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HUMAN HYDROLASE PROTEINS 

TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences of hydrolase proteins and to 
5 the use of these sequences in the diagnosis, treatment, and prevention of cell proliferative, 
autoimmune/inflammatory, neurological, renal, adrenal, and genetic disorders. 



BACKGROUND OF THE INVENTION 

Hydrolysis is the breaking of a covalent bond in a substrate by introduction of a water 

10 molecule. The reaction involves a nucleophilic attack by the water molecule's oxygen atom on a 

target bond in the substrate. The water molecule is split across the target bond, breaking the bond and 
generating two product molecules. Hydrolases participate in reactions essential to functions such as 
cell signaling, cell proliferation, inflammation, apoptosis, secretion and excretion. Hydrolases are 
involved in key steps in disease processes involving these functions. Hydrolases, or hydrolytic 

15 enzymes, may be grouped by substrate specificity into classes including aminohydroiases, 

phospholipases, carboxyl-esterases, phosphodiesterases, glycosidases, glyoxalases, sulfatases, 
phosphohydrolases, serine hydrolases, and lysozymes. 

NG ? NG-dimethylarginine dimethylaminohydrolase (DDAH) is an enzyme that hydrolyzes the 
endogenous nitric oxide synthase (NOS) inhibitors, NG-monomethyl-arginine and NG,NG-dimethyl- 

20 L-arginine, to L-citrulline. Inhibiting DDAH can cause increased intracellular concentration of NOS 
inhibitors to levels sufficient to inhibit NOS. Therefore, DDAH inhibition may provide a method of 
NOS inhibition, and changes in the activity of DDAH could play a role in pathophysiological 
alterations in nitric oxide generation (MacAllister, R.J. et al. (1996) Br. J. Pharmacol. 1 19:1533- 
1540). DDAH was found in neurons displaying cytoskeletal abnormalities and oxidative stress in 

25 Alzheimer's disease. In age-matched control cases, DDAH was not found in neurons. This suggests 
that oxidative stress- and nitric oxide-mediated events play a role in the pathogenesis of Alzheimer's 
disease (Smith, M.A. et al. (1998) Free Rad. Biol. Med. 25:898-902). 

Phosphodiesterases catalyze the hydrolysis of one of the two ester bonds in a phosphodiester 
compound. Phosphodiesterases are, therefore, crucial to a variety of cellular processes. 

30 Phosphodiesterases include DNA and RNA endo- and exo-nucleases, which are essential to cell 
growth and replication as well as protein synthesis. 

Pancreatic lipase and colipase form a complex that plays a key role in dietary fat digestion by 
converting insoluble long chain triacylgycerols into more polar molecules able to cross the brush 
border of intestinal cells. Colipase binds to the C-terminal domain of lipase. In solution, this 
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interaction involves the formation of an ion pair between a glutamic acid residue of colipase and a 
lysine residue of lipase. These residues are strictly conserved among species (Ayvazian, L. et al. 
(1998) J. Biol. Chern. 273:33604-33609). Colipase appears to overcome the inhibitory effects of bile 
salts on pancreatic lipase (OMIM 246600 on April 28, 1999). 
5 Carboxylesterases are proteins that hydrolyze carboxylic esters and are classified into three 

categories- A, B, and C. Most type-B carboxylesterases are evolutionary related and are considered 
to comprise a family of proteins. The type-B carboxylesterase family of proteins includes vertebrate 
acetylcholinesterase, mammalian liver microsomal carboxylesterase, mammalian bile-salt- activated 
lipase, and duck fatty acyl-CoA hydrolase. Some members of this protein family are not catalytically 

10 active but contain a domain related evolutionarily to other type-B carboxylesterases, such as 

thyroglobulin and the Drosophila protein neuractin. The active site of carboxylesterases involves 
three residues: a serine, a glutamate or aspartate, and a histidine. The sequence surrounding this 
catalytic site is well conserved and can be used as a signature pattern (PROSITE: PDOC001 12 at 
www.expasy.ch/cgi-bin/get-prodoc-entry on May 1 1, 1999). 

15 Acyl-CoA thioesterase is another member of the carboxylesterase family (Alexson, S.E. et al. 

(1993) Eur, J. Biochem. 214:719-727). Evidence suggests that acyl-CoA thioesterase has a regulatory 
role in steroidogenic tissues (Finkielstein, C. et al. (1998) Eur. J. Biochem. 256:60-66). 

A phospholipase A 2 inhibitor has been identified that has 33% sequence homology with 
human leucine-rich ^-glycoprotein (Okumura, K. et al. (1998) J. Biol. Chem. 273:19469-19475). 

20 Leucine-rich repeat (LRR) consensus sequences have also been found in the primary structure of 
many proteins, including proteins that participate in biologically important processes, such as 
receptors for hormones, enzymes, enzyme inhibitors, proteins for cell adhesion, and ribosome-binding 
proteins. All proteins containing LRR domains are thought to be involved in protein- protein 
interactions. 

25 The glyoxylase system consists of glyoxalase I, which catalyzes the formation of S-D- 

lactoylglutathione from methyglyoxal, a side product of triose-phosphate energy metabolism, and 
glyoxylase II, which hydrolyzes S-D-lactoylglutathione to D-lactic acid and reduced glutathione. 
Methyglyoxal levels are elevated during hyperglycemia, likely due to increased triose-phosphate 
energy metabolism. Elevated levels of glyoxylase II activity have been found in human non-insulin- 

30 dependent diabetes mellitus and in a rat model of this disease. The glyoxylase system has been 
implicated in the detoxification of bacterial toxins, and in the control of cell proliferation and 
microtubule assembly. Elevated levels of S-D-lactoylglutathione, the substrate of glyoxylase II, 
induced growth arrest and toxicity in HL60 cells. Thus, the glyoxylase system, and glyoxylase II in 
particular, may be associated with cell proliferation and autoimmune disorders such as diabetes. 
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The alpha/beta hydrolase protein fold is common to several hydrolases of diverse 
phylogenetic origin and catalytic function. Enzymes with the alpha/beta hydrolase fold have a 
common core structure consisting of eight beta-sheets connected by alpha-helices. The most 
conserved structural feature of this fold is the loops of the nucleophile-histidine-acid catalytic triad. 
5 The histidine in the catalytic triad is completely conserved, while the nucleophile and acid loops 
accommodate more than one type of amino acid (Ollis, D.L. et al. (1992) Protein Eng. 5:197-21 1). 

Sulfatases are members of a highly conserved gene family that share extensive sequence 
homology and a high degree of structural similarity. Sulfatases catalyze the cleavage of sulfate esters. 
To perform this function, sulfatases undergo a unique post-translational modification in the 
10 endoplasmic reticulum that involves the oxidation of a conserved cysteine residue. A human disorder 
called multiple sulfatase deficiency is due to a defect in this post-translational modification step, 
leading to inactive sulfatases (Recksiek, M. et al. (1998) J. Biol. Chem. 273:6096-6103). 

Phosphohydrolases are enzymes that hydrolyze phosphate esters. Some phosphohydrolases 
contain a mutT domain signature sequence. MutT is a protein involved in the GO system responsible 
15 for removing an oxidatively damaged form of guanine from DNA. A region of about 40 amino acid 
residues, found in the N-terminus of mutT, is also found in other proteins, including some 
phosphohydrolases (PROSITE: PDOC00695 at www.expasy.ch/cgi-bin/get-prodoc-entry on April 27, 
1999). 

Glycosidases catalyze the cleavage of hemiacety! bonds of glycosides, which are compounds 
20 that contain one or more sugar. Mammalian beta-galactosidase removes the terminal galactose from 
gangliosides, glycoproteins, and glycosaminoglycans. Beta-galactosidases belong to family 35 in the 
classification of glycosyl hydrolases. Deficiency of this enzyme is associated with the genetic disease 
GM1 -gangliosidosis, also known as Morquio disease type B (PROSITE: PDOC00910 at 
www.expasy.ch/cgi-bin/get-prodoc-entryon May 12, 1999). 
25 Serine hydrolases are a functional class of hydrolytic enzymes that contain a serine residue in 

their active site. This class of enzymes contains proteinases, esterases, and lipases which hydrolyze a 
variety of substrates and, therefore, have different biological roles. Proteins in this superfamily can be 
further grouped into subfamilies based on substrate specificity or amino acid similarities (Puente, X.S. 
and C. Lopez-Ont (1995) J. Biol. Chem. 270:12926-12932). One member of the serine hydrolase 
30 superfamily is kraken, a Drosophila gene isolated from a Drosophila embryo cDNA library. Kraken 
belongs to a subfamily whose members catalyze cleavage of substrates with a carbonyl-containing 
group (Chan, E. et al. (1998) Gene 222:195-201). 

The lysozyme c superfamily consists of conventional lysozymes c, calcium-binding 
lysozymes c, and a-iactalbumin (Prager, E.M. and P. Jolles (1996) EXS 75:9-3 1)- The proteins in this 



WO 00/28045 

superfamily have 35-40% sequence homology and share a common three-dimensional fold, but can 
have different functions. Lysozymes c are ubiquitous in a variety of tissues and secretions and can 
lyse the cell walls of certain bacteria (McKenzie, H.A. (1996) EXS 75:365-409). Alpha-lactalbumin 
is a metalio-protein that binds calcium and participates in the synthesis of lactose (Iyer, L.K. and P.K. 
5 Qasba (1999) Protein Eng. 12:129-139). Alpha-lactalbumin occurs in mammalian milk and colostrum 
(McKenzie, supra ). 

Lysozymes catalyze the hydrolysis of certain mucopolysaccharides of bacterial cell walls, 
specifically, the beta (1-4) glycosidic linkages between N-acetylmuramic acid and N- 
acetylglucosamine, and cause bacterial lysis. Lysozymes occur in diverse organisms including 

10 viruses, birds, and mammals. In humans, lysozymes are found in spleen, lung, kidney, white blood 
cells, plasma, saliva, milk, tears, and cartilage (Online Mendelian Inheritance in Man (OMIM) 
#153450 Lysozyme; Weaver, L.H. et al. (1985) J. MoL Biol. 184:739-741). Lysozyme c functions in 
ruminants as a digestive enzyme, releasing proteins from ingested bacterial cells, and may perform the 
same function in human newborns (Braun, O.H. et aL (1995) Klin. Pediatr. 207:4-7). 

15 The two known forms of lysozymes, chicken-type and goose-type, were originally isolated 

from chicken and goose egg white, respectively. Chicken-type and goose-type lysozymes have 
similar three-dimensional structures, but different amino acid sequences (Nakano, T. and T. Graf 
(1991) Biochim. Biophys. Acta 1090:273-276). In chickens, both forms of lysozyme are found in 
neutrophil granulocytes (heterophils), but only chicken-type lysozyme is found in egg white. 

20 Generally, chicken-type lysozyme mRNA is found in both adherent monocytes and macrophages and 
nonadherent promyelocytes and granulocytes as well as in cells of the bone marrow, spleen, bursa, 
and oviduct. Goose-type lysozyme mRNA is found in non-adherent cells of the bone marrow and 
lung. Several isozymes have been found in rabbits, including leukocytic, gastrointestinal, and 
possibly lymphoepithelial forms (OMIM #153450, supra; Nakano and Graf, supra ; and GenBank GI 

25 131 0929). A human lysozyme gene encoding a protein similar to chicken-type lysozyme has been 
cloned (Yoshimura, K. et al. (1988) Biochem. Biophys. Res. Commun. 150:794-801). A consensus 
motif featuring regularly spaced cysteine residues has been derived from the lysozyme C enzymes of 
various species (Prosite PS00128, http://expasy.hcuge.ch Swiss Institute of Bioinformatics). 
Lysozyme C shares about 40% amino acid sequence identity with a-lactalbumin. 

30 Lysozymes have several disease associations. Lysozymuria is observed in diabetic 

nephropathy (Shima, M. et al.(1986) Clin. Chem. 32:1 81 8-1 822), endemic nephropathy (Bruckner, I. 
et al. (1978) Med. Interne. 16:1 17-125), urinary tract infections (Heidegger, H. (1990) Minerva 
Ginecol. 42:243-250), and acute monocytic leukemia (Shaw, M.T. (1978) Am. J. Hematol. 4:97-103). 
Nakano ( supra ) suggested a role for lysozyme in host defense systems. Older rabbits with an 
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inherited lysozyme deficiency show increased susceptibility to infections, such as subcutaneous 
abscesses (OMIM #153450, supra). Human lysozyme gene mutations cause hereditary systemic 
amyloidosis, a rare autosomal dominant disease in which amyloid deposits form in the viscera, 
including the kidney, adrenal glands, spleen, and liver. This disease is usually fatal by the fifth 
5 decade. The amyloid deposits contain variant forms of lysozyme. Renal amyloidosis is the most 
common and potentially the most serious form of organ involvement (Pepys, M.B. et al. (1993) 
Nature 362:553-557; OMIM #105200 Familial Visceral Amyloidosis; Cotran, R.S. et al. (1994) 
Robbins Pathologic Basis of Disease , W.B. Saunders Company, Philadelphia PA, pp. 231-238). 
Increased levels of lysozyme and lactate have been observed in the cerebrospinal fluid of patients 

10 with bacterial meningitis (Ponka, A. et al. (1 983) Infection 11:129-131). Acute monocytic leukemia 
is characterized by massive lysozymuria (Den Tandt, W.R. (1988) Int. J. Biochem, 20:713-719). 

The discovery of new hydrolase proteins and the polynucleotides encoding them satisfies a 
need in the art by providing new compositions which are useful in the diagnosis, prevention, and 
treatment of cell proliferative, autoimmune/inflammatory, neurological, renal, adrenal, and genetic 

15 disorders. 

SUMMARY OF THE INVENTION 

The invention features substantially purified polypeptides, hydrolase proteins, referred to 
collectively as "HYDRL" and individually as "HYDRL-1," "HYDRL-2 " "HYDRL-3," "HYDRL-4," 

20 "HYDRL-5 " "HYDRL-6," "HYDRL-7," "HYDRL-8," "HYDRL-9," "HYDRL-10," "HYDRL-l 1 >" 
"HYDRL-12," "HYDRL-13," "HYDRL-14," "HYDRL-15," and "HYDRL-16." In one aspect, the 
invention provides a substantially purified polypeptide comprising an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-1 6 and fragments thereof. The invention also includes a 
polypeptide comprising an amino acid sequence that differs by one or more conservative amino acid 

25 substitutions from an amino acid sequence selected from the group consisting of SEQ ID NO: 1-1 6. 

The invention further provides a substantially purified variant having at least 90% amino acid 
identity to at least one of the amino acid sequences selected from the group consisting of SEQ ID 
NO: 1-1 6 and fragments thereof. The invention also provides an isolated and purified polynucleotide 
encoding the polypeptide comprising an amino acid sequence selected from the group consisting of 

30 SEQ ID NO: 1-16 and fragments thereof. The invention also includes an isolated and purified 

polynucleotide variant having at least 90% polynucleotide sequence identify to the polynucleotide 
encoding the polypeptide comprising an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-16 and fragments thereof 

Additionally, the invention provides an isolated and purified polynucleotide which hybridizes 
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under stringent conditions to the polynucleotide encoding the polypeptide comprising an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-16 and fragments thereof. The 
invention also provides an isolated and purified polynucleotide having a sequence which is 
complementary to the polynucleotide encoding the polypeptide comprising the amino acid sequence 
5 selected from the group consisting of SEQ ID NO: 1-16 and fragments thereof. 

The invention also provides a method for detecting a polynucleotide in a sample containing 
nucleic acids, the method comprising the steps of: (a) hybridizing the complement of the 
polynucleotide sequence to at least one of the polynucleotides of the sample, thereby forming a 
hybridization complex; and (b) detecting the hybridization complex, wherein the presence of the 
10 hybridization complex correlates with the presence of a polynucleotide in the sample. In one aspect, 
the method further comprises amplifying the polynucleotide prior to hybridization. 

The invention also provides an isolated and purified polynucleotide comprising a 
polynucleotide sequence selected from the group consisting of SEQ ID NO: 1 7-32 and fragments 
thereof. The invention further provides an isolated and purified polynucleotide variant having at least 
15 90% polynucleotide sequence identity to the polynucleotide sequence selected from the group 

consisting of SEQ ID NO: 17-32 and fragments thereof. The invention also provides an isolated and 
purified polynucleotide having a sequence which is complementary to the polynucleotide comprising 
a polynucleotide sequence selected from the group consisting of SEQ ID NO: 1 7-32 and fragments 
thereof. 

20 The invention further provides an expression vector containing at least a fragment of the 

polynucleotide encoding the polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-16. In another aspect, the expression vector is contained within a host 
cell. 

The invention also provides a method for producing a polypeptide, the method comprising the 
25 steps of: (a) culturing the host cell containing an expression vector containing a polynucleotide of the 
invention under conditions suitable for the expression of the polypeptide; and (b) recovering the 
polypeptide from the host cell culture. 

The invention also provides a pharmaceutical composition comprising a substantially purified 
polypeptide having the amino acid sequence selected from the group consisting of SEQ ID NO: 1-16 
30 and fragments thereof, in conjunction with a suitable pharmaceutical carrier. 

The invention further includes a purified antibody which binds to a polypeptide selected from 
the group consisting of SEQ ID NO: 1- 1 6 and fragments thereof The invention also provides a 
purified agonist and a purified antagonist to the polypeptide. 

The invention also provides a method for treating or preventing a disorder associated with 
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decreased expression or activity of HYDRL, tne method comprising administering to a subject in need 
of such treatment an effective amount of a pharmaceutical composition comprising a substantially 
purified polypeptide having the amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-1 6 and fragments thereof, in conjunction with a suitable pharmaceutical carrier. 
5 The invention also provides a method for treating or preventing a disorder associated with 

increased expression or activity of HYDRL, the method comprising administering to a subject in need 
of such treatment an effective amount of an antagonist of a polypeptide having an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-1 6 and fragments thereof. 

j o BRIEF DESCRIPTION OF THE FIGURE AND TABLES 

Figure 1 shows the amino acid sequence alignment between HYDRL- 1 (incyte Clone ID 
2293764; SEQ ID NO: 1 ), Colobus guereza lysozyme-c precursor (GI 1 790927; SEQ ID NO:33), 
Colobus angolensis lysozyme-c precursor (GI 1790967; SEQ ID NO:34), and Nasalis larvatis 
lysozyme-c precursor (GI 1790984; SEQ ID NO:35), produced using the multisequence alignment 
1 5 program of LASERGENE software (DNASTAR, Madison WI). 

Table 1 shows polypeptide and nucleotide sequence identification numbers (SEQ ID NOs), 
clone identification numbers (clone IDs), cDNA libraries, and cDNA fragments used to assemble full- 
length sequences encoding HYDRL. 

Table 2 shows features of each polypeptide sequence, including potential motifs, homologous 
20 sequences, and methods, algorithms, and searchable databases used for analysis of HYDRL. 

Table 3 shows selected fragments of each nucleic acid sequence; the tissue-specific 
expression patterns of each nucleic acid sequence as determined by northern analysis; diseases, 
disorders, or conditions associated with these tissues; and the vector into which each cDNA was 
cloned. 

25 Table 4 describes the tissues used to construct the cDNA libraries from which cDNA clones 

encoding HYDRL were isolated. 

Table 5 shows the tools, programs, and algorithms used to analyze HYDRL, along with 
applicable descriptions, references, and threshold parameters. 

30 DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is understood 
that this invention is not limited to the particular machines, materials and methods described, as these 
may vary. It is also to be understood that the terminology used herein is for the purpose of describing 
particular embodiments only, and is not intended to limit the scope of the present invention which will 
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be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a," "an, 1 ' 
and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a 
reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a 
reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so 
forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meanings as commonly understood by one of ordinary skill in the art to which this invention belongs. 
Although any machines, materials, and methods similar or equivalent to those described herein can be 
used to practice or test the present invention, the preferred machines, materials and methods are now 
described. All publications mentioned herein are cited for the purpose of describing and disclosing 
the cell lines, protocols, reagents and vectors which are reported in the publications and which might 
be used in connection with the invention. Nothing herein is to be construed as an admission that the 
invention is not entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

"HYDRL" refers to the amino acid sequences of substantially purified HYDRL obtained from 
any species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 

The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 
HYDRL. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of HYDRL either by directly interacting with 
HYDRL or by acting on components of the biological pathway in which HYDRL participates. 

An "allelic variant" is an alternative form of the gene encoding HYDRL. Allelic variants may 
result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
Each of these types of changes may occur alone, or in combination with the others, one or more times 
in a given sequence. 

"Altered" nucleic acid sequences encoding HYDRL include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as HYDRL or 
a polypeptide with at least one functional characteristic of HYDRL. Included within this definition 
are polymorphisms which may or may not be readily detectable using a particular oligonucleotide 
probe of the polynucleotide encoding HYDRL, and improper or unexpected hybridization to allelic 
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variants, with a locus other than the normal chromosomal locus for the polynucleotide sequence 
encoding HYDRL. The encoded protein may also be "altered," and may contain deletions, insertions, 
or substitutions of amino acid residues which produce a silent change and result in a functionally 
equivalent HYDRL. Deliberate amino acid substitutions may be made on the basis of similarity in 
5 polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the 
residues, as long as the biological or immunological activity of HYDRL is retained. For example, 
negatively charged amino acids may include aspartic acid and glutamic acid, and positively charged 
amino acids may include lysine and arginine. Amino acids with uncharged polar side chains having 
similar hydrophilicity values may include: asparagine and glutamine; and serine and threonine. 

10 Amino acids with uncharged side chains having similar hydrophilicity values may include: leucine, 
isoleucine, and valine; glycine and alanine; and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 
molecules. Where "amino acid sequence" is recited to refer to an amino acid sequence of a naturally 

15 occurring protein molecule, "amino acid sequence" and like terms are not meant to limit the amino 
acid sequence to the complete native amino acid sequence associated with the recited protein 
molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PCR) technologies well 
20 known in the art. 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity 
of HYDRL. Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, small 
molecules, or any other compound or composition which modulates the activity of HYDRL either by 
directly interacting with HYDRL or by acting on components of the biological pathway in which 

25 HYDRL participates. 

The term "antibody" refers to intact immunoglobulin molecules as well as to fragments 
thereof, such as Fab, F(ab') 2 , and Fv fragments, which are capable of binding an epitopic determinant. 
Antibodies that bind HYDRL polypeptides can be prepared using intact polypeptides or using 
fragments containing small peptides of interest as the immunizing antigen. The polypeptide or 

30 oligopeptide used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the 
translation of RNA, or synthesized chemically, and can be conjugated to a carrier protein if desired. 
Commonly used carriers that are chemically coupled to peptides include bovine serum albumin, 
thyroglobulin, and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize 
the animal. 
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The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 
makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies 
which bind specifically to antigenic determinants (particular regions or three-dimensional structures 
5 on the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen 
used to elicit the immune response) for binding to an antibody. 

The term "antisense" refers to any composition containing a nucleic acid sequence which is 
complementary to the "sense" strand of a specific nucleic acid sequence. Antisense molecules may be 
produced by any method including synthesis or transcription. Once introduced into a cell, the 

10 complementary nucleotides combine with natural sequences produced by the cell to form duplexes 

and to block either transcription or translation. The designation "negative" or "minus" can refer to the 
antisense strand, and the designation "positive" or "plus" can refer to the sense strand. 

The term "biologically active" refers to a protein having structural, regulatory, or biochemical 
functions of a naturally occurring molecule. Likewise, "immunologically active" refers to the 

15 capability of the natural, recombinant, or synthetic HYDRL, or of any oligopeptide thereof, to induce 
a specific immune response in appropriate animals or cells and to bind with specific antibodies. 

The terms "complementary" and "complementarity" refer to the natural binding of 
polynucleotides by base pairing. For example, the sequence "5* A-G-T 3'" bonds to the 
complementary sequence "3 f T-C-A 5'." Complementarity between two single-stranded molecules 

20 may be "partial," such that only some of the nucleic acids bind, or it may be "complete," such that 
total complementarity exists between the single stranded molecules. The degree of complementarity 
between nucleic acid strands has significant effects on the efficiency and strength of the hybridization 
between the nucleic acid strands. This is of particular importance in amplification reactions, which 
depend upon binding between nucleic acid strands, and in the design and use of peptide nucleic acid 

25 (PNA) molecules. 

A "composition comprising a given polynucleotide sequence" and a "composition comprising 
a given amino acid sequence" refer broadly to any composition containing the given polynucleotide or 
amino acid sequence. The composition may comprise a dry formulation or an aqueous solution. 
Compositions comprising polynucleotide sequences encoding HYDRL or fragments of HYDRL may 
30 be employed as hybridization probes. The probes may be stored in freeze-dried form and may be 
associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 
deployed in an aqueous solution containing salts (e.g., NaCl), detergents (e.g., sodium dodecyl 
sulfate; SDS), and other components (e.g., Denhardt's solution, dry milk, salmon sperm DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been resequenced to 
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resolve uncalled bases, extended using the XL-PCR kit (Perkin-Elmer, Norwalk CT) in the 5' and/or 
the 3' direction, and resequenced, or which has been assembled from the overlapping sequences of 
one or more Incyte Clones and, in some cases, one or more public domain ESTs, using a computer 
program for fragment assembly, such as the GELV1EW fragment assembly system (GCG, Madison 
5 WI). Some sequences have been both extended and assembled to produce the consensus sequence. 
"Conservative amino acid substitutions" are those substitutions that, when made, least 
interfere with the properties of the original protein, i.e., the structure and especially the function of the 
protein is conserved and not significantly changed by such substitutions. The table below shows 
amino acids which may be substituted for an original amino acid in a protein and which are regarded 
10 as conservative amino acid substitutions. 





Original Residue 


Conservative Substitution 




Ala 


Gly, Ser 




Arg 


His, Lys 




Asn 


Asp, Gin, His 


15 


Asp 


Asn, Glu 




Cys 


Ala, Ser 




Gin 


Asn, Glu, His 




Giu 


Asp, Gin, His 




Gly 


Ala 


20 


His 


Asn, Arg, Gin, Glu 




lie 


Leu, Val 




Leu 


He, Val 




Lys 


Arg, Gin, Glu 




Met 


Leu, He 


25 


Phe 


His, Met, Leu, Trp, Tyr 




Ser 


Cys, Thr 




Thr 


Ser, Val 




Trp 


Phe, Tyr 




Tyr 


His, Phe, Trp 


30 


Val 


He, Leu, Thr 



Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 
backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of 
35 the side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to the chemical modification of a polypeptide sequence, or a 
polynucleotide sequence. Chemical modifications of a polynucleotide sequence can include, for 
40 example, replacement of hydrogen by an alky I, acyl, hydroxyl, or amino group. A derivative 
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polynucleotide encodes a polypeptide which retains at least one biological or immunological function 
of the natural molecule. A derivative polypeptide is one modified by glycosylation, pegylation, or any 
similar process that retains at least one biological or immunological function of the polypeptide from 
which it was derived. 

5 A "fragment" is a unique portion of HYDRL or the polynucleotide encoding HYDRL which 

is identical in sequence to but shorter in length than the parent sequence. A fragment may comprise 
up to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For 
example, a fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A 
fragment used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 

10 5, 10, 15, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or amino 

acid residues in length. Fragments may be preferentially selected from certain regions of a molecule. 
For example, a polypeptide fragment may comprise a certain length of contiguous amino acids 
selected from the first 250 or 500 amino acids (or first 25% or 50% of a polypeptide) as shown in a 
certain defined sequence. Clearly these lengths are exemplary, and any length that is supported by the 

15 specification, including the Sequence Listing, tables, and figures, may be encompassed by the present 
embodiments. 

A fragment of SEQ ID NO: 1 7-32 comprises a region of unique polynucleotide sequence that 
specifically identifies SEQ ID NO: 17-32, for example, as distinct from any other sequence in the 
same genome. A fragment of SEQ ID NO: 17-32 is useful, for example, in hybridization and 

20 amplification technologies and in analogous methods that distinguish SEQ ID NO: 17-32 from related 
polynucleotide sequences. The precise length of a fragment of SEQ ID NO: 1 7-32 and the region of 
SEQ ID NO: 17-32 to which the fragment corresponds are routinely determinable by one of ordinary 
skill in the art based on the intended purpose for the fragment. 

A fragment of SEQ ID NO: 1 - 1 6 is encoded by a fragment of SEQ ID NO: 1 7-32. A fragment 

25 of SEQ ID NO: 1-1 6 comprises a region of unique amino acid sequence that specifically identifies 

SEQ ID NO: 1-16. For example, a fragment of SEQ ID NO: 1 - 1 6 is useful as an immunogenic peptide 
for the development of antibodies that specifically recognize SEQ ID NO: 1-1 6. The precise length of 
a fragment of SEQ ID NO: 1-1 6 and the region of SEQ ID NO: 1-16 to which the fragment 
corresponds are routinely determinable by one of ordinary skill in the art based on the intended 

30 purpose for the fragment. 

The term "similarity" refers to a degree of complementarity. There may be partial similarity 
or complete similarity. The word "identity" may substitute for the word "similarity." A partially 
complementary sequence that at least partially inhibits an identical sequence from hybridizing to a 
target nucleic acid is referred to as "substantially similar." The inhibition of hybridization of the 
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completely complementary sequence to the target sequence may be examined using a hybridization 
assay (Southern or northern blot, solution hybridization, and the like) under conditions of reduced 
stringency. A substantially similar sequence or hybridization probe will compete for and inhibit the 
binding of a completely similar (identical) sequence to the target sequence under conditions of 
5 reduced stringency. This is not to say that conditions of reduced stringency are such that non-specific 
binding is permitted, as reduced stringency conditions require that the binding of two sequences to 
one another be a specific (i.e., a selective) interaction. The absence of non-specific binding may be 
tested by the use of a second target sequence which lacks even a partial degree of complementarity 
(e.g., less than about 30% similarity or identity). In the absence of non-specific binding, the 
10 substantially similar sequence or probe will not hybridize to the second non-complementary target 
sequence. 

The phrases "percent identity" and "% identity," as applied to polynucleotide sequences, refer 
to the percentage of residue matches between at least two polynucleotide sequences aligned using a 
standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps 

15 in the sequences being compared in order to optimize alignment between two sequences, and 
therefore achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEG ALIGN version 3.12e 
sequence alignment program. This program is part of the LASERGENE software package, a suite of 

20 molecular biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in 

Higgins, D.G. and P.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) CABIOS 
8:189-191. For pairwise alignments of polynucleotide sequences, the default parameters are set as 
follows: Ktuple=2, gap penalry=5, window=4, and "diagonals saved' -4. The "weighted" residue 
weight table is selected as the default. Percent identity is reported by CLUSTAL V as the "percent 

25 similarity" between aligned polynucleotide sequence pairs. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms 
is provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment 
Search Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403-410), which is available from 
several sources, including the NCBI, Bethesda, MD, and on the Internet at 

30 http:/ /www.ncbi. nlm.nih.gov/BLAST/ . The BLAST software suite includes various sequence analysis 
programs including "blastn," that is used to align a known polynucleotide sequence with other 
polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at http:/ Av\vw.ncbi.nlm, nih.gov/gorf/bl2. html . The 
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"BLAST 2 Sequences'' tool can be used for both hiastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0.9 (May-07-1999) set at default parameters. Such default parameters may be, for example; 
5 Matrix: BLOSUM62 

Reward for match: 1 

Penalty for mismatch: -2 

Open Gap: 5 and Extension Gap: 2 penalties 

Gap x drop-off: 50 
10 Expect: JO 

Word Size: J J 

Filter: on 

Percent identity may be measured over the length of an entire defined sequence, for example, 
as defined by a particular SEQ ID number, or may be measured over a shorter length, for example, 

15 over the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at 
least 20, at least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous 
nucleotides. Such lengths are exemplary only, and it is understood that any fragment length supported 
by the sequences shown herein, in the tables, figures, or Sequence Listing, may be used to describe a 
length over which percentage identity may be measured. 

20 Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 

similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes 
in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid 
sequences that all encode substantially the same protein. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to 

25 the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some 
alignment methods take into account conservative amino acid substitutions. Such conservative 
substitutions, explained in more detail above, generally preserve the hydrophobicity and acidity at the 
site of substitution, thus preserving the structure (and therefore function) of the polypeptide. 

30 Percent identity between polypeptide sequences may be determined using the default 

parameters of the CLUSTAL V algorithm as incorporated into the MEG ALIGN version 3.12e 
sequence alignment program (described and referenced above). For pairwise alignments of 
polypeptide sequences using CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap 
penalty=3, window^, and "diagonals saved"^5. The PAM250 matrix is selected as the default 
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residue weight table. As with polynucleotide alignments, the percent identity is reported by 
CLUSTAL V as the "percent similarity" between aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 2.0.9 
5 (May-07- 1 999) with blastp set at default parameters. Such default parameters may be, for example: 

Matrix: BLOSUM62 

Open Gap: 11 and Extension Gap: 1 penalties 
Gap x drop-off: 50 
Expect: 10 
1 o Word Size: 3 

Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, 
for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for 

15 instance, a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 
150 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment 
length supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be 
used to describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 

20 DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the elements required for 
stable mitotic chromosome segregation and maintenance. 

The term "humanized antibody" refers to antibody molecules in which the amino acid 
sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

25 "Hybridization" refers to the process by which a polynucleotide strand anneals with a 

complementary strand through base pairing under defined hybridization conditions. Specific 
hybridization is an indication that two nucleic acid sequences share a high degree of identity. Specific 
hybridization complexes form under permissive annealing conditions and remain hybridized after the 
"washing" step(s). The washing step(s) is particularly important in determining the stringency of the 

30 hybridization process, with more stringent conditions allowing less non-specific binding, i.e., binding 
between pairs of nucleic acid strands that are not perfectly matched. Permissive conditions for 
annealing of nucleic acid sequences are routinely determinable by one of ordinary skill in the art and 
may be consistent among hybridization experiments, whereas wash conditions may be varied among 
experiments to achieve the desired stringency, and therefore hybridization specificity. Permissive 
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annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC, about 1% (w/v) 
SDS, and about 100 ng/ml denatured salmon sperm DNA. 

Generally, stringency of hybridization is expressed, in part, with reference to the temperature 
under which the wash step is carried out. Generally, such wash temperatures are selected to be about 
5 5°C to 20°C lower than the thermal melting point (T m ) for the specific sequence at a defined ionic 
strength and pH. The T m is the temperature (under defined ionic strength and pH) at which 50% of 
the target sequence hybridizes to a perfectly matched probe. An equation for calculating T m and 
conditions for nucleic acid hybridization are well known and can be found in Sambrook et al., 1989, 
Molecular Cloning: A Laboratory Manual , 2 nd ed., vol. 1-3, Cold Spring Harbor Press, PlainviewNY; 

1 0 specifically see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the present invention 
include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1% SDS, for 1 hour. 
Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used. SSC concentration 
may be varied from about 0.1 to 2 x SSC, with SDS being present at about 0.1 %. Typically, blocking 

15 reagents are used to block non-specific hybridization. Such blocking reagents include, for instance, 
denatured salmon sperm DNA at about 100-200 ug/ml. Organic solvent, such as formamide at a 
concentration of about 35-50% v/v, may also be used under particular circumstances, such as for 
RNA:DNA hybridizations. Useful variations on these wash conditions will be readily apparent to 
those of ordinary skill in the art. Hybridization, particularly under high stringency conditions, may be 

20 suggestive of evolutionary similarity between the nucleotides. Such similarity is strongly indicative 
of a similar role for the nucleotides and their encoded polypeptides. 

The term "hybridization complex" refers to a complex formed between two nucleic acid 
sequences by virtue of the formation of hydrogen bonds between complementary bases. A 
hybridization complex may be formed in solution (e.g., C 0 t or R^t analysis) or formed between one 

25 nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a solid 
support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate 
to which cells or their nucleic acids have been fixed). 

The words "insertion" and "addition" refer to changes in an amino acid or nucleotide 
sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

30 "Immune response" can refer to conditions associated with inflammation, trauma, immune 

disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression 
of various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect 
cellular and systemic defense systems. 

The term "microarray" refers to an arrangement of distinct polynucleotides on a substrate. 
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The terms "element" and "array element" in a microarray context, refer to hybridizable 
polynucleotides arranged on the surface of a substrate. 

The term "modulate" refers to a change in the activity of HYDRL. For example, modulation 
may cause an increase or a decrease in protein activity, binding characteristics, or any other 
5 biological, functional, or immunological properties of HYDRL. 

The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 
polynucleotide, or any fragment thereof These phrases also refer to DNA or RNA of genomic or 
synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RN A-like material. 

10 "Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 

functional relationship with the second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Generally, operably linked DNA sequences may be in close proximity or contiguous and, 
where necessary to join two protein coding regions, in the same reading frame. 

15 "Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 

comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. 
PNAs preferentially bind complementary single stranded DNA or RNA and stop transcript elongation, 
and may be pegylated to extend their lifespan in the cell. 

20 "Probe" refers to nucleic acid sequences encoding HYDRL, their complements, or fragments 

thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are 
isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. 
Typical labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. 
"Primers" are short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target 

25 polynucleotide by complementary base-pairing. The primer may then be extended along the target 
DNA strand by a DNA polymerase enzyme. Primer pairs can be used for amplification (and 
identification) of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 
nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 

30 be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers 
may be considerably longer than these examples, and it is understood that any length supported by the 
specification, including the tables, figures, and Sequence Listing, may be used. 

Methods for preparing and using probes and primers are described in the references, for 



17 



WO 00/28045 



PCTYUS99/27009 



example Sambrook et a!., 1989, Molecular Cloning: A La boratory Manual 2 nd ed., vol. 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubel et «l 19X7. Current Protocols in Molecular Biology, 
Greene Publ. Assoc. & Wiley-lntersciences, New York NY; lnnis et al., 1990, PCR Protocols, A 
Guide to Methods and Applications , Academic Press, San Diego CA. PCR primer pairs can be 
5 derived from a known sequence, for example, by using computer programs intended for that purpose 
such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge MA). 

Oligonucleotides for use as primers are selected using software known in the art for such 
purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 

10 5,000 nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer 

selection programs have incorporated additional features for expanded capabilities. For example, the 
PrirnOU primer selection program (available to the public from the Genome Center at University of 
Texas South West Medical Center, Dallas TX) is capable of choosing specific primers from megabase 
sequences and is thus useful for designing primers on a genome-wide scope. The Primer3 primer 

1 5 selection program (available to the public from the Whitehead Institute/MIT Center for Genome 
Research, Cambridge MA) allows the user to input a "mispriming library," in which sequences to 
avoid as primer binding sites are user-specified. Primer3 is useful, in particular, for the selection of 
oligonucleotides for microarrays. (The source code for the latter two primer selection programs may 
also be obtained from their respective sources and modified to meet the user's specific needs.) The 

20 PrimeGen program (available to the public from the UK Human Genome Mapping Project Resource 
Centre, Cambridge UK) designs primers based on multiple sequence alignments, thereby allowing 
selection of primers that hybridize to either the most conserved or least conserved regions of aligned 
nucleic acid sequences. Hence, this program is useful for identification of both unique and conserved 
oligonucleotides and polynucleotide fragments. The oligonucleotides and polynucleotide fragments 

25 identified by any of the above selection methods are useful in hybridization technologies, for 

example, as PCR or sequencing primers, microarray elements, or specific probes to identify fully or 
partially complementary polynucleotides in a sample of nucleic acids. Methods of oligonucleotide 
selection are not limited to those described above. 

A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 

30 that is made by an artificial combination of two or more otherwise separated segments of sequence. 
This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 
artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 
such as those described in Sam brook, su^ra. The term recombinant includes nucleic acids that have 
been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, a 
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recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter 
sequence. Such a recombinant nucleic acid may be part of a vector that is used, for example, to 
transform a cell. 

Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
5 vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
expressed, inducing a protective immunological response in the mammal. 

The term "sample" is used in its broadest sense. A sample suspected of containing nucleic 
acids encoding HYDRL, or fragments thereof, or HYDRL itself, may comprise a bodily fluid; an 
extract from a cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, 
10 RNA, or cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding" refer to that interaction between a 
protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
synthetic binding composition. The interaction is dependent upon the presence of a particular 
structure of the protein, e.g., the antigenic determinant or epitope, recognized by the binding 
15 molecule. For example, if an antibody is specific for epitope "A," the presence of a polypeptide 

containing the epitope A, or the presence of free unlabeled A, in a reaction containing free labeled A 
and the antibody will reduce the amount of labeled A that binds to the antibody. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 
removed from their natural environment and are isolated or separated, and are at least about 60% free, 
20 preferably about 75% free, and most preferably about 90% free from other components with which 
they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acids or nucleotides by 
different amino acids or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
25 chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 

microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

"Transformation" describes a process by which exogenous DNA enters and changes a 
recipient cell. Transformation may occur under natural or artificial conditions according to various 
30 methods well known in the art, and may rely on any known method for the insertion of foreign nucleic 
acid sequences into a prokaryotic or eukaryotk host cell. The method for transformation is selected 
based on the type of host cell being transformed and may include, but is not limited to, viral infection, 
electroporation, heat shock, lipofection, and particle bombardment. The term "transformed" cells 
includes stably transformed cells in which the inserted DNA is capable of replication either as an 
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autonomously replicating plasmid or as part of the host chromosome, as well as transiently 
transformed cells which express the inserted DNA or RNA for limited periods of time. 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having 
at least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of 
5 the nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 95% or at least 98% or 
greater sequence identity over a certain defined length. A variant may be described as, for example, 
an "allelic" (as defined above), "splice," "species," or "polymorphic" variant. A splice variant may 

1 0 have significant identity to a reference molecule, but will generally have a greater or lesser number of 
polynucleotides due to alternate splicing of exons during mRNA processing. The corresponding 
polypeptide may possess additional functional domains or lack domains that are present in the 
reference molecule. Species variants are polynucleotide sequences that vary from one species to 
another. The resulting polypeptides generally will have significant amino acid identity relative to 

15 each other. A polymorphic variant is a variation in the polynucleotide sequence of a particular gene 
between individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
polymorphisms" (SNPs) in which the polynucleotide sequence varies by one nucleotide base. The 
presence of SlMPs may be indicative of, for example, a certain population, a disease state, or a 
propensity for a disease state. 

20 A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having 

at least 40% sequence identity to the particular polypeptide sequence over a certain length of one of 
the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 98% or greater sequence 

25 identity over a certain defined length of one of the polypeptides. 
THE INVENTION 

The invention is based on the discovery of new human hydrolase proteins (HYDRL), the 
polynucleotides encoding HYDRL, and the use of these compositions for the diagnosis, treatment, or 
prevention of cell proliferative, autoimmune/inflammatory, neurological, renal, adrenal, and genetic 
30 disorders. 

Table 1 lists the Incyte clones used to assemble full length nucleotide sequences encoding 
HYDRL. Columns 1 and 2 show the sequence identification numbers (SEQ ID NOs) of the 
polypeptide and nucleotide sequences, respectively. Column 3 shows the clone IDs of the Incyte 
clones in which nucleic acids encoding each HYDRL were identified, and column 4 shows the cDNA 
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libraries from which these clones were isolated. Column 5 shows Incyte clones and their 
corresponding cDNA libraries. Clones for which cDNA libraries are not indicated were derived from 
pooled cDNA libraries. The Incyte clones in column 5 were used to assemble the consensus 
nucleotide sequence of each HYDRL and are useful as fragments in hybridization technologies. 

The columns of Table 2 show various properties of each of the polypeptides of the invention: 
column 1 references the SEQ ID NO; column 2 shows the number of amino acid residues in each 
polypeptide; column 3 shows potential phosphorylation sites; column 4 shows potential glycosylation 
sites; column 5 shows the amino acid residues comprising signature sequences and motifs; column 6 
shows homologous sequences as identified by BLAST analysis; and column 7 shows analytical 
methods and in some cases, searchable databases to which the analytical methods were applied. The 
methods of column 7 were used to characterize each polypeptide through sequence homology and 
protein motifs. 

As shown in Figure 1, HYDRL- 1 has chemical and structural similarity with Colobus guereza 
lysozyme-c precursor (GI 1790927; SEQ IDNO:33\ Colobus angolensis lysozyme-c precursor (GI 
1 790967; SEQ ID NO:34) and Nasalis larvatis lysozyme-c precursor (GI 1 790984; SEQ ID NO:35). 
In particular, HYDRL- 1 and Colobus guereza lysozyme-c precursor share 40% identity, HYDRL- 1 
and Colobus angolensis lysozyme-c precursor share 40% identity, and HYDRL- 1 and Nasalis larvatis 
lysozyme-c precursor share 41% identity. 

The columns of Table 3 show the tissue-specificity and diseases, disorders, or conditions 
associated with nucleotide sequences encoding HYDRL. The first column of Table 3 lists the 
nucleotide SEQ ID NOs. Column 2 lists fragments of the nucleotide sequences of column 1 . These 
fragments are useful, for example, in hybridization or amplification technologies to identify SEQ ID 
NO: 17-32 and to distinguish between SEQ ID NO: 17-32 and related polynucleotide sequences. The 
polypeptides encoded by these fragments are useful, for example, as immunogenic peptides. Column 
3 lists tissue categories which express HYDRL as a fraction of total tissues expressing HYDRL. 
Column 4 lists diseases, disorders, or conditions associated with those tissues expressing HYDRL as a 
fraction of total tissues expressing HYDRL. Column 5 lists the vectors used to subclone each cDNA 
library. 

Northern analysis of SEQ ID NO: 1 7 shows the expression of this sequence in tissue 
associated with cancer. 

The columns of Table 4 show descriptions of the tissues used to construct the cDNA libraries 
from which cDNA clones encoding HYDRL were isolated. Column 3 references the nucleotide SEQ 
ID NOs, column 2 shows the cDNA libraries from which these clones were isolated, and column 3 
shows the tissue origins and other descriptive information relevant to the cDNA libraries in column 2. 
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SEQ ID "NO: 1 8 maps to chromosome 6 within the interval from 42.30 to 45.40 centiMorgans, 
to chromosome 9 within the interval from 130.40 to 166.50 centiMorgans, and to chromosome 16 
within the interval from 88.10 to 92.60 centiMorgans. 

SEQ ID NO:25 maps to chromosome 1 within the interval from 22.90 to 39.90 centiMorgans 
5 and to chromosome 3 within the interval from 30.90 to 43.00 centiMorgans. The interval on 

chromosome 3 from 30.90 to 43.00 centiMorgans also contains an EST associated with von Hippel- 
Lindau syndrome. 

SEQIDNO:28 maps to chromosome 10 within the interval from 137.60 to 139.20 
centiMorgans. 

10 The invention also encompasses HYDRL variants. A preferred HYDRL variant is one which 

has at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid 
sequence identity to the HYDRL amino acid sequence, and which contains at least one functional or 
structural characteristic of HYDRL. 

The invention also encompasses polynucleotides which encode HYDRL. In a particular 

15 embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected 
from the group consisting of SEQ ID NO: 17-32, which encodes HYDRL. 

The invention also encompasses a variant of a polynucleotide sequence encoding HYDRL. In 
particular, such a variant polynucleotide sequence will have at least about 80%, or alternatively at 
least about 90%, or even at least about 95% polynucleotide sequence identity to the polynucleotide 

20 sequence encoding HYDRL. A particular aspect of the invention encompasses a variant of a 
polynucleotide sequence comprising a sequence selected from the group consisting of SEQ ID 
NO: 17-32 which has at least about 80%, or alternatively at least about 90%, or even at least about 
95% polynucleotide sequence identity to a nucleic acid sequence selected from the group consisting 
of SEQ ID NO:17-32. Any one of the polynucleotide variants described above can encode an amino 

25 acid sequence which contains at least one functional or structural characteristic of HYDRL. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the 
genetic code, a multitude of polynucleotide sequences encoding HYDRL, some bearing minimal 
similarity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 

30 sequence that could be made by selecting combinations based on possible codon choices. These 
combinations are made in accordance with the standard triplet genetic code as applied to the 
polynucleotide sequence of naturally occurring HYDRL, and all such variations are to be considered 
as being specifically disclosed. 

Although nucleotide sequences which encode HYDRL and its variants are generally capable 
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of hybridizing to the nucleotide sequence of the naturally occurring HYDRL under appropriately 
selected conditions of stringency, it may be advantageous to produce nucleotide sequences encoding 
HYDRL or its derivatives possessing a substantially different codon usage, e.g., inclusion of non- 
naturaliy occurring codons. Codons may be selected to increase the rate at which expression of the 
5 peptide occurs in a particular prokaryotic or eukaryotic host in accordance with the frequency with 
which particular codons are utilized by the host. Other reasons for substantially altering the 
nucleotide sequence encoding HYDRL and its derivatives without altering the encoded amino acid 
sequences include the production of RNA transcripts having more desirable properties, such as a 
greater half-life, than transcripts produced from the naturally occurring sequence. 

10 The invention also encompasses production of DNA sequences which encode HYDRL and 

HYDRL derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
synthetic sequence may be inserted into any of the many available expression vectors and cell systems 
using reagents well known in the art. Moreover, synthetic chemistry may be used to introduce 
mutations into a sequence encoding HYDRL or any fragment thereof. 

15 Also encompassed by the invention are polynucleotide sequences that are capable of 

hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO: 1 7-32 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. and 
S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods EnzymoL 
152:507-51 1.) Hybridization conditions, including annealing and wash conditions, are described in 

20 "Definitions." 

Methods for DNA sequencing are well known in the art and may be used to practice any of 
the embodiments of the invention. The methods may employ such enzymes as the Klenow fragment 
of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (Perkin- 
Elmer), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway NJ), or 

25 combinations of polymerases and proofreading exonucleases such as those found in the ELONGASE 
amplification system (Life Technologies, Gaithersburg MD). Preferably, sequence preparation is 
automated with machines such as the MICROLAB 2200 liquid transfer system (Hamilton, Reno NV), 
PTC200 thermal cycler (MJ Research, Watertown MA) and ABI CATALYST 800 thermal cycler 
(Perkin-Elmer). Sequencing is then carried out using either the ABI 373 or 377 DNA sequencing 

30 system (Perkin-Elmer), the MEGABACE 1000 DNA sequencing system (Molecular Dynamics, 
Sunnyvale CA), or other systems known in the art. The resulting sequences are analyzed using a 
variety of algorithms which are well known in the art. (See, e.g., Ausubel, F.M. (1997) Short 
Protocols in Molecular Biology . John Wiley & Sons, New York NY, unit 7.7; Meyers, R.A. (1995) 
Molecular Biology and Biotechnology , Wiley VCH, New York NY, pp. 856-853.) 
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The nucleic acid sequences encoding HYDRL may be extended utilizing a partial nucleotide 
sequence and employing various PCR-based methods known in the art to detect upstream sequences, 
such as promoters and regulatory elements. For example, one method which may be employed, 
restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic 
DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Appiic. 2:318-322.) 
Another method, inverse PCR, uses primers that extend in divergent directions to amplify unknown 
sequence from a circularized template. The template is derived from restriction fragments comprising 
a known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids 
Res. 16:8186.) A third method, capture PCR, involves PCR amplification of DNA fragments adjacent 
to known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, M. et al. 
( 1 99 1 ) PCR Methods Applic. 1:111-119.) In this method, multiple restriction enzyme digestions and 
ligations may be used to insert an engineered double-stranded sequence into a region of unknown 
sequence before performing PCR. Other methods which may be used to retrieve unknown sequences 
are known in the art. (See, e.g., Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-3060). 
Additionally, one may use PCR, nested primers, and PROMOTERFJNDER libraries (Clontech, Palo 
Alto CA) to walk genomic DNA. This procedure avoids the need to screen libraries and is useful in 
finding intron/exon junctions. For all PCR-based methods, primers may be designed using 
commercially available software, such as OLIGO 4.06 Primer Analysis software (National 
Biosciences, Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in 
length, to have a GC content of about 50% or more, and to anneal to the template at temperatures of 
about 68°C to 72°C. 

When screening for full-length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5* regions of genes, are preferable for situations in which an oligo d(T) 
library does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence 
into 5' non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to analyze 
the size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
speciflc, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Perkin-Elmer), and the entire process 
from loading of samples to computer analysis and electronic data display may be computer controlled. 
Capillary electrophoresis is especially preferable for sequencing small DNA fragments which may be 
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present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof 
which encode HYDRL may be cloned in recombinant DNA molecules that direct expression of 
HYDRL, or fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent 
degeneracy of the genetic code, other DNA sequences which encode substantially the same or a 
functionally equivalent amino acid sequence may be produced and used to express HYDRL. 

The nucleotide sequences of the present invention can be engineered using methods generally 
known in the art in order to alter HYDRL-encoding sequences for a variety of purposes including, but 
not limited to, modification of the cloning, processing, and/or expression of the gene product. DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 
oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide- 
mediated site-directed mutagenesis may be used to introduce mutations that create new restriction 
sites, alter glycosylation patterns, change codon preference, produce splice variants, and so forth. 

In another embodiment, sequences encoding HYDRL may be synthesized, in whole or in part, 
using chemical methods well known in the art. (See, e.g., Caruthers, M.H. et al. (1980) Nucleic Acids 
Symp. Ser. 7:215-223; and Horn, T. et al. (1980)Nucleic Acids Symp. Ser. 7:225-232.) 
Alternatively, HYDRL itself or a fragment thereof may be synthesized using chemical methods. For 
example, peptide synthesis can be performed using various solid-phase techniques. (See, e.g., 
Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated synthesis may be achieved using the 
ABI 431 A peptide synthesizer (Perkin-Elmer). Additionally, the amino acid sequence of HYDRL, or 
any part thereof, may be altered during direct synthesis and/or combined with sequences from other 
proteins, or any part thereof, to produce a variant polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 
chromatography. (See, e.g., Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392-421.) 
The composition of the synthetic peptides may be confirmed by amino acid analysis or by sequencing. 
(See, e.g., Creighton, T. (1984) Proteins, Structures and Molecular Properties , WH Freeman, New 
York NY.) 

In order to express a biologically active HYDRL, the nucleotide sequences encoding HYDRL 
or derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which 
contains the necessary elements for transcriptional and translational control of the inserted coding 
sequence in a suitable host. These elements include regulatory sequences, such as enhancers, 
constitutive and inducible promoters, and 5 1 and 3' untranslated regions in the vector and in 
polynucleotide sequences encoding HYDRL. Such elements may vary in their strength and 
specificity. Specific initiation signals may also be used to achieve more efficient translation of 
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sequences encoding HYDRL. Such signals include the ATG initiation codon and adjacent sequences, 
e.g. the Kozak sequence. In cases where sequences encoding HYDRL and its initiation codon and 
upstream regulatory sequences are inserted into the appropriate expression vector, no additional 
transcriptional or transiational control signals may be needed. However, in cases where only coding 
sequence, or a fragment thereof, is inserted, exogenous transiational control signals including an in- 
frame ATG initiation codon should be provided by the vector. Exogenous transiational elements and 
initiation codons may be of various origins, both natural and synthetic. The efficiency of expression 
may be enhanced by the inclusion of enhancers appropriate for the particular host cell system used. 
(See, e.g., Scharf, D. et al. (1994) Results Probl. Cell Differ. 20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct expression 
vectors containing sequences encoding HYDRL and appropriate transcriptional and transiational 
control elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, 
and in vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) Molecular Cloning, A 
Laboratory Manual . Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausubel, F.M. et 
ah (1995) Current Protocols in Molecular Biology . John Wiley & Sons, New York NY, ch. 9, 13, and 
16.) 

A variety of expression vector/host systems may be utilized to contain and express sequences 
encoding HYDRL. These include, but are not limited to, microorganisms such as bacteria 
transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast 
transformed with yeast expression vectors; insect cell systems infected with viral expression vectors 
(e.g., baculovirus); plant cell systems transformed with viral expression vectors (e.g., cauliflower 
mosaic virus, CaMV, or tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or 
pBR322 plasmids); or animal cell systems. The invention is not limited by the host cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected depending 
upon the use intended for polynucleotide sequences encoding HYDRL. For example, routine cloning, 
subcloning, and propagation of polynucleotide sequences encoding HYDRL can be achieved using a 
multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORT1 
plasmid (Life Technologies). Ligation of sequences encoding HYDRL into the vector's multiple 
cloning site disrupts the lacZ gene, allowing a colorimetric screening procedure for identification of 
transformed bacteria containing recombinant molecules. In addition, these vectors may be useful for 
in vitro transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of 
nested deletions in the cloned sequence. (See, e.g., Van Heeke, G. and S.M. Schuster (1989) J. Biol. 
Chem. 264:5503-5509.) When large quantities of HYDRL are needed, e.g. for the production of 
antibodies, vectors which direct high level expression of HYDRL may be used. For example, vectors 
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containing the strong, inducible T5 or T7 bacteriophage promoter may be used. 

Yeast expression systems may be used for production of HYDRL. A number of vectors 
containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the yeast Saccharomvces cerevisiae or Pichia pastoris . In addition, such 
vectors direct either the secretion or intracellular retention of expressed proteins and enable 
integration of foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 
1995, supra ; Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; and Scorer, C.A. et al. (1994) 
Bio/Technology 12:181-184.) 

Plant systems may also be used for expression of HYDRL. Transcription of sequences 
encoding HYDRL may be driven viral promoters, e.g., the 35S and 19S promoters of CaMV used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 
6:307-31 1). Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock 
promoters may be used. (See, e.g., Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. 
(1984) Science 224:838-843; and Winter, J. et ah (1991) Results Probl. Cell Differ. 17:85-105.) 
These constructs can be introduced into plant cells by direct DNA transformation or 
pathogen-mediated transfection. (See, e.g., The McGraw Hill Yearbook of Science and Technology 
(1992) McGraw Hill, New York NY, pp. 191-196.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, sequences encoding HYDRL may be ligated into 
an adenovirus transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain 
infective virus which expresses HYDRL in host cells. (See, e.g., Logan, J. and T. Shenk (1984) Proc. 
Natl. Acad. Sci. USA 81:3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma 
virus (RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV- 
based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 
DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycationic amino 
polymers, or vesicles) for therapeutic purposes. (See, e.g., Harrington, J.J. et al. (1997) Nat. Genet. 
15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable expression of 
HYDRL in cell lines is preferred. For example, sequences encoding HYDRL can be transformed into 
cell lines using expression vectors which may contain viral origins of replication and/or endogenous 
expression elements and a selectable marker gene on the same or on a separate vector. Following the 



27 



WO 00/28045 



PCT/US99/27009 



introduction of the vector, cells may be allowed to grow for about 1 to 2 days in enriched media 
before being switched to selective media. The purpose of the selectable marker is to confer resistance 
to a selective agent, and its presence allows growth and recovery of cells which successfully express 
the introduced sequences. Resistant clones of stably transformed cells may be propagated using tissue 
5 culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These 
include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 
phosphoribosy transferase genes, for use in tk and apr cells, respectively. (See, e.g., Wigler, IVL et 
al. (1977) Cell 1 1:223-232; Lowy, I. et ah (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, 

10 or herbicide resistance can be used as the basis for selection. For example, dhfr confers resistance to 
methotrexate; neo confers resistance to the aminoglycosides neomycin and G-41 8; and als and pat 
confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively. (See, e.g., 
Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et aL (1981) 
J. MoL Biol. 150:1-14.) Additional selectable genes have been described, e.g., trpB and hisD, which 

15 alter cellular requirements for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc. 
Natl. Acad. Sci. USA 85:8047-805 3 .) Visible markers, e.g., anthocyanins, green fluorescent proteins 
(GFP; Clontech), 6 glucuronidase and its substrate B-glucuronide, or luciferase and its substrate 
luciferin may be used. These markers can be used not only to identify transformants, but also to 
quantify the amount of transient or stable protein expression attributable to a specific vector system. 

20 (See, e.g., Rhodes, C.A. (1995) Methods Mol. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of interest is 
also present, the presence and expression of the gene may need to be confirmed. For example, if the 
sequence encoding HYDRL is inserted within a marker gene sequence, transformed cells containing 
sequences encoding HYDRL can be identified by the absence of marker gene function. Alternatively, 

25 a marker gene can be placed in tandem with a sequence encoding HYDRL under the control of a 

single promoter. Expression of the marker gene in response to induction or selection usually indicates 
expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid sequence encoding HYDRL and that 
express HYDRL may be identified by a variety of procedures known to those of skill in the art. These 

30 procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR 

amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or 
chip based technologies for the detection and/or quantification of nucleic acid or protein sequences. 

Immunological methods for detecting and measuring the expression of HYDRL using either 
specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques 
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include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and 
fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive to two non-interfering epitopes on HYDRL is preferred, but a 
competitive binding assay may be employed. These and other assays are well known in the art. (See, 
5 e.g., Hampton, R. et al. (1990) Serological Methods, a Laboratory Manual , APS Press, St. Paul MN, 
Sect. IV; Coligan, LE. et al. (1997) Current Protocols in immunology , Greene Pub. Associates and 
Wiley-Interscience, New York NY; and Pound, J.D. (1998) Immunochemical Protocols , Humana 
Press, Totowa NJ.) 

A wide variety of labels and conjugation techniques are known by those skilled in the art and 

10 may be used in various nucleic acid and amino acid assays. Means for producing labeled 

hybridization or PGR probes for detecting sequences related to polynucleotides encoding HYDRL 
include oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled nucleotide. 
Alternatively, the sequences encoding HYDRL, or any fragments thereof, may be cloned into a vector 
for the production of an mRNA probe. Such vectors are known in the art, are commercially available, 

15 and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase 
such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety 
of commercially available kits, such as those provided by Amersham Pharmacia Biotech, Promega 
(Madison WI), and US Biochemical. Suitable reporter molecules or labels which may be used for 
ease of detection include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic 

20 agents, as well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding HYDRL may be cultured under 
conditions suitable for the expression and recovery of the protein from cell culture. The protein 
produced by a transformed cell may be secreted or retained intracellularly depending on the sequence 
and/or the vector used. As will be understood by those of skill in the art, expression vectors 

25 containing polynucleotides which encode HYDRL may be designed to contain signal sequences 
which direct secretion of HYDRL through a prokaryotic or eukaryotic cell membrane. 

In addition, a host eel! strain may be chosen for its ability to modulate expression of the 
inserted sequences or to process the expressed protein in the desired fashion. Such modifications of 
the polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, 

30 phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a "prepro" or 
"pro" form of the protein may also be used to specify protein targeting, folding, and/or activity. 
Different host cells which have specific cellular machinery and characteristic mechanisms for 
post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from the 
American Type Culture Collection (ATCC, Manassas VA) and may be chosen to ensure the correct 
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modification and processing of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding HYDRL may be ligated to a heterologous sequence resulting in translation of a 
fusion protein in any of the aforementioned host systems. For example, a chimeric HYDRL protein 
containing a heterologous moiety that can be recognized by a commercially available antibody may 
facilitate the screening of peptide libraries for inhibitors of HYDRL activity. Heterologous protein 
and peptide moieties may also facilitate purification of fusion proteins using commercially available 
affinity matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), 
maltose binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, 
c-myc^ and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate 
fusion proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal- 
chelate resins, respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoafflnity 
purification of fusion proteins using commercially available monoclonal and polyclonal antibodies 
that specifically recognize these epitope tags. A fusion protein may also be engineered to contain a 
proteolytic cleavage site located between the HYDRL encoding sequence and the heterologous 
protein sequence, so that HYDRL may be cleaved away from the heterologous moiety following 
purification. Methods for fusion protein expression and purification are discussed in Ausubel (1995, 
supra , ch. 10). A variety of commercially available kits may also be used to facilitate expression and 
purification of fusion proteins. 

In a further embodiment of the invention, synthesis of radiolabeled HYDRL may be achieved 
in vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These 
systems couple transcription and translation of protein-coding sequences operably associated with the 
T7, T3, or SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid 
precursor, for example, 35 S-methionine. 

Fragments of HYDRL may be produced not only by recombinant means, but also by direct 
peptide synthesis using solid-phase techniques. (See, e.g., Creighton, supra, pp. 55-60.) Protein 
synthesis may be performed by manual techniques or by automation. Automated synthesis may be 
achieved, for example, using the ABI 431 A peptide synthesizer (Perkin-Elmer). Various fragments of 
HYDRL may be synthesized separately and then combined to produce the full length molecule. 
THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists between 
regions of HYDRL and hydrolase proteins. In addition, the expression of HYDRL is closely 
associated with proliferating tissues, inflamed tissues, neurological tissues, and cancer. In some 
cases, sequences encoding HYDRL map to chromosomal regions associated with inherited diseases. 
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Therefore, HYDRL appears to play a role in cell proliferative, auto immune/ inflammatory, 
neurological/renal, adrenal, and genetic disorders. In the treatment of disorders associated with 
increased HYDRL expression or activity, it is desirable to decrease the expression or activity of 
HYDRL. In the treatment of disorders associated with decreased HYDRL expression or activity, it is 
5 desirable to increase the expression or activity of HYDRL. 

Therefore, in one embodiment, HYDRL or a fragment or derivative thereof may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of HYDRL. Examples of such disorders include, but are not limited to, a cell proliferative 
disorder, such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed 

10 connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, 

polycythemia vera, psoriasis, primary thrombocythemia, and a cancer including adenocarcinoma, 
leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, a cancer of 
the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, 
gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, 

15 salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; an autoimmune/inflammatory 
disorder, such as inflammation, actinic keratosis, acquired immunodeficiency syndrome (AIDS), 
Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, 
anemia, arteriosclerosis, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune 
thyroiditis, autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), 

20 bronchitis, bursitis, cholecystitis, cirrhosis, contact dermatitis, Crohn's disease, atopic dermatitis, 
dermatomyositis, diabetes mellitus, emphysema, episodic lymphonpeia with lymphocytotoxins, 
erythroblastosis fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's 
syndrome, gout, Graves' disease, Hashimoto's thyroiditis, paroxysmal nocturnal hemoglobinuria, 
hepatitis, hypereosinophilia, irritable bowel syndrome, episodic lymphopenia with lymphocytotoxins, 

25 mixed connective tissue disease (MCTD), multiple sclerosis, myasthenia gravis, myocardial or 

pericardial inflammation, myelofibrosis, osteoarthritis, osteoporosis, pancreatitis, polycythemia vera, 
polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, 
systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, primary thrombocythemia, 
thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, 

30 hemodialysis, and extracorporeal circulation; viral, bacterial, fungal, parasitic, protozoal, and 
helminthic infections; trauma, and hematopoietic cancer including lymphoma, leukemia, and 
myeloma; a neurological disorder, such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral 
neoplasms, Alzheimer's disease, Pick's disease, Huntington's disease, dementia, Parkinson's disease 
and other extrapyramidal disorders, amyotrophic lateral sclerosis and other motor neuron disorders, 
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progressive neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and 
other demyelinating diseases, bacterial and viral meningitis, brain abscess, subdural empyema, 
epidural abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central 
nervous system disease; prion diseases including kuru, Creutzfeldt-Jakob disease, and Gerstmann- 
Straussler-Scheinker syndrome; fatal familial insomnia, nutritional and metabolic diseases of the 
nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
encephalotrigeminal syndrome, mental retardation and other developmental disorder of the central 
nervous system, cerebral palsy, a neuroskeletal disorder, an autonomic nervous system disorder, a 
cranial nerve disorder, a spinal cord disease, muscular dystrophy and other neuromuscular disorder, a 
peripheral nervous system disorder, dermatomyositis and polymyositis; inherited, metabolic, 
endocrine, and toxic myopathy; myasthenia gravis, periodic paralysis; a mental disorder including 
mood, anxiety, and schizophrenic disorders; seasonal affective disorder (SAD); akathesia, amnesia, 
catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic 
neuralgia, and Tourette's disorder; a renal disorder such as renal amyloidosis, hypertension; primary 
aldosteronism; Addison's disease; renal failure; glomerulonephritis; chronic glomerulonephritis; 
tubulointerstitial nephritis; cystic disorders of the kidney and dysplastic malformations such as 
polycystic disease, renal dysplasias, and cortical or medullary cysts; inherited polycystic renal 
diseases (PRD), such as recessive and autosomal dominant PRD; medullary cystic disease; medullary 
sponge kidney and tubular dysplasia; Alport's syndrome; non-renal cancers which affect renal 
physiology, such as bronchogenic tumors of the lungs or tumors of the basal region of the brain; 
multiple myeloma; adenocarcinomas of the kidney; and metastatic renal carcinoma; an adrenal 
disorder such as angina, anaphylactic shock, arrhythmias, asthma, cardiovascular shock, Cushing's 
syndrome, hypertension, hypoglycemia, myocardial infarction, migraine, and pheochromocytoma; a 
genetic disorder, such as GM1 -gangliosidosis, Niemann-Pick disease, adreno leukodystrophy, Alport's 
syndrome, choroideremia, Duchenne and Becker muscular dystrophy, Down's syndrome, cystic 
fibrosis, chronic granulomatous disease, Gaucher' s disease, Huntington's chorea, Marfan 's syndrome, 
muscular dystrophy, myotonic dystrophy, pycnodysostosis, Refsum's syndrome, retinoblastoma, 
sickle cell anemia, thalassemia, Werner syndrome, von Willebrand's disease, von Hippel-Lindau 
syndrome, Wilms' tumor, Zellweger syndrome, peroxisomal acyl-CoA oxidase deficiency, 
peroxisomal thiolase deficiency, peroxisomal bifunctional protein deficiency, mitochondrial carnitine 
palmitoyl transferase and carnitine deficiency, mitochondrial very-long-chain acyl-CoA 
dehydrogenase deficiency, mitochondrial medium-chain acyl-CoA dehydrogenase deficiency, 
mitochondrial short-chain acyl-CoA dehydrogenase deficiency, mitochondrial electron transport 
flavoprotein and electron transport fiavoprotein:ubiquinone oxidoreductase deficiency, mitochondrial 
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trifunctional protein deficiency, and mitochondrial short-chain 3-hydroxyacyl-CoA dehydrogenase 
deficiency. 

In another embodiment, a vector capable of expressing HYDRL or a fragment or derivative 
thereof may be administered to a subject to treat or prevent a disorder associated with decreased 
5 expression or activity of HYDRL including, but not limited to, those described above. 

In a further embodiment, a pharmaceutical composition comprising a substantially purified 
HYDRL in conjunction with a suitable pharmaceutical carrier may be administered to a subject to 
treat or prevent a disorder associated with decreased expression or activity of HYDRL including, but 
not limited to, those provided above. 
10 In still another embodiment, an agonist which modulates the activity of HYDRL may be 

administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of HYDRL including, but not limited to, those listed above. 

In a further embodiment, an antagonist of HYDRL may be administered to a subject to treat 
or prevent a disorder associated with increased expression or activity of HYDRL. Examples of such 
15 disorders include, but are not limited to, those cell proliferative, autoimmune/inflammatory, 

neurological, renal, adrenal, and genetic disorders described above. In one aspect, an antibody which 
specifically binds HYDRL may be used directly as an antagonist or indirectly as a targeting or 
delivery mechanism for bringing a pharmaceutical agent to cells or tissues which express HYDRL. 
In an additional embodiment, a vector expressing the complement of the polynucleotide 
20 encoding HYDRL may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of HYDRL including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 
sequences, or vectors of the invention may be administered in combination with other appropriate 
therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made 
25 by one of ordinary skill in the art, according to conventional pharmaceutical principles. The 

combination of therapeutic agents may act synergistically to effect the treatment or prevention of the 
various disorders described above. Using this approach, one may be able to achieve therapeutic 
efficacy with lower dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of HYDRL may be produced using methods which are generally known in the 
30 art. In particular, purified HYDRL may be used to produce antibodies or to screen libraries of 

pharmaceutical agents to identify those which specifically bind HYDRL. Antibodies to HYDRL may 
also be generated using methods that are well known in the art. Such antibodies may include, but are 
not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and 
fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which inhibit 
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dimer formation) are generally preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 
and others may be immunized by injection with HYDRL or with any fragment or oligopeptide thereof 
which has immunogenic properties. Depending on the host species, various adjuvants may be used to 
5 increase immunological response. Such adjuvants include, but are not limited to, Freund's, mineral 
gels such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in humans, 
BCG (bacilli Calmette-Guerin) and Corvnebacterium parvum are especially preferable. 

It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 

10 HYDRL have an amino acid sequence consisting of at least about 5 amino acids, and generally will 
consist of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or 
fragments are identical to a portion of the amino acid sequence of the natural protein and contain the 
entire amino acid sequence of a small, naturally occurring molecule. Short stretches of HYDRL 
amino acids may be fused with those of another protein, such as KLH, and antibodies to the chimeric 

15 molecule may be produced. 

Monoclonal antibodies to HYDRL may be prepared using any technique which provides for 
the production of antibody molecules by continuous cell lines in culture. These include, but are not 
limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma 
technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. 

20 Immunol. Methods 81:31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. USA 80:2026-2030; and 
Cole, S.P. et al. (1984) MoL Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies" such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) Proc. 

25 Natl. Acad. Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 3 12:604-608; and Takeda, 
S. et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single 
chain antibodies may be adapted, using methods known in the art, to produce HYDRL-specific single 
chain antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be 
generated by chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g., 

30 Burton, D.R.(1991) Proc. Natl. Acad. Sci. USA 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte 
population or by screening immunoglobulin libraries or panels of highly specific binding reagents as 
disclosed in the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 
86:3833-3837; Winter, G. et al. (1991)Nature 349:293-299.) 

34 



WO 00/28045 



PCT/US99/27009 



Antibody fragments which contain specific binding sites for HYDRL may also be generated. 
For example, such fragments include, but are not limited to, F(ab') 2 fragments produced by pepsin 
digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of 
the F(ab')2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and 
5 easy identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. 
et al. ( 1 989) Science 246: 1 275- 1281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art. Such 

10 immunoassays typically involve the measurement of complex formation between HYDRL and its 
specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 
reactive to two non-interfering HYDRL epitopes is generally used, but a competitive binding assay 
may also be employed (Pound, supra ). 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay 

15 techniques may be used to assess the affinity of antibodies for HYDRL. Affinity is expressed as an 
association constant, K a , which is defined as the molar concentration of HYDRL-antibody complex 
divided by the molar concentrations of free antigen and free antibody under equilibrium conditions. 
The K a determined for a preparation of polyclonal antibodies, which are heterogeneous in their 
affinities for multiple HYDRL epitopes, represents the average affinity, or avidity, of the antibodies 

20 for HYDRL. The determined for a preparation of monoclonal antibodies, which are monospecific 
for a particular FfYDRL epitope, represents a true measure of affinity. High-affinity antibody 
preparations with K a ranging from about 1 0 9 to 1 0 12 L/mole are preferred for use in immunoassays in 
which the FlYDRL-antibody complex must withstand rigorous manipulations. Low-affinity antibody 
preparations with K a ranging from about 10 6 to 10 7 L/mole are preferred for use in 

25 immunopurification and similar procedures which ultimately require dissociation of HYDRL, 
preferably in active form, from the antibody (Catty, D. (1988) Antibodies, Volume I: A Practica l 
Approach , IRL Press, Washington, DC; Liddell, J.E. and Cryer, A. (1991) A Practical Guide to 
Monoclonal Antibodies , John Wiley & Sons, New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to 

30 determine the quality and suitability of such preparations for certain downstream applications. For 

example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, preferably 
5-10 mg specific antibody/ml, is generally employed in procedures requiring precipitation of 
ITYDRL-antibody complexes. Procedures for evaluating antibody specificity, titer, and avidity, and 
guidelines for antibody quality and usage in various applications, are generally available. (See, e.g., 
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Catty, supra , and Coligan et al. supra .) 

In another embodiment of the invention, the polynucleotides encoding HYDRL, or any 
fragment or complement thereof, may be used for therapeutic purposes. In one aspect, the 
complement of the polynucleotide encoding HYDRL may be used in situations in which it would be 
desirable to block the transcription of the mRNA. In particular, ceils may be transformed with 
sequences complementary to polynucleotides encoding HYDRL. Thus, complementary molecules or 
fragments may be used to modulate HYDRL activity, or to achieve regulation of gene function. Such 
technology is now well known in the art, and sense or antisense oligonucleotides or larger fragments 
can be designed from various locations along the coding or control regions of sequences encoding 
HYDRL. 

Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, or 
from various bacterial plasmids, may be used for delivery of nucleotide sequences to the targeted 
organ, tissue, or cell population. Methods which are well known to those skilled in the art can be used 
to construct vectors to express nucleic acid sequences complementary to the polynucleotides encoding 
HYDRL. (See, e.g., Sambrook, supra ; Ausubel, 1995, supra .) 

Genes encoding HYDRL can be turned off by transforming a cell or tissue with expression 
vectors which express high levels of a polynucleotide, or fragment thereof, encoding HYDRL. Such 
constructs may be used to introduce untranslatable sense or antisense sequences into a cell. Even in 
the absence of integration into the DNA, such vectors may continue to transcribe RNA molecules 
until they are disabled by endogenous nucleases. Transient expression may last for a month or more 
with a non-replicating vector, and may last even longer if appropriate replication elements are part of 
the vector system. 

As mentioned above, modifications of gene expression can be obtained by designing 
complementary sequences or antisense molecules (DNA, RNA, or PNA) to the control, 5', or 
regulatory regions of the gene encoding HYDRL. Oligonucleotides derived from the transcription 
initiation site, e.g., between about positions -10 and +10 from the start site, may be employed. 
Similarly, inhibition can be achieved using triple helix base-pairing methodology. Triple helix pairing 
is useful because it causes inhibition of the ability of the double helix to open sufficiently for the 
binding of polymerases, transcription factors, or regulatory molecules. Recent therapeutic advances 
using triplex DNA have been described in the literature. (See, e.g., Gee, J.E. et al. (1994) in Huber, 
B.E. and B J. Carr, Molecular and Immunologic Approaches . Futura Publishing, Mt. Kisco NY, pp. 
163-177.) A complementary sequence or antisense molecule may also be designed to block 
translation of mRNA by preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 
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RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 
engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
endonucleolytic cleavage of sequences encoding HYDRL. 
5 Specific ribozyme cleavage sites within any potential RNA target are initially identified by 

scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 
GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 
0 candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared 
by any method known in the art for the synthesis of nucleic acid molecules. These include techniques 
for chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 
5 Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA 
sequences encoding HYDRL. Such DNA sequences may be incorporated into a wide variety of 
vectors with suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA 
constructs that synthesize complementary RNA, constitutivefy or inducibly, can be introduced into 
cell lines, cells, or tissues. 
0 RNA molecules may be modified to increase intracellular stability and half-life. Possible 

modifications include, but are not limited to, the addition of flanking sequences at the 5' and/or 3' ends 
of the molecule, or the use of phosphorothioate or 2' O-methyl rather than phosphodiesterase linkages 
within the backbone of the molecule. This concept is inherent in the production of PNAs and can be 
extended in all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, 
and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, 
guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases. 

Many methods for introducing vectors into cells or tissues are available and equally suitable 
for use in vivo , in vitro, and ex vivo. For ex vivo therapy, vectors may be introduced into stem cells 
taken from the patient and clonally propagated for autologous transplant back into that same patient. 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
using methods which are well known in the art. (See, e.g., Goldman, C.K. et al. (1997) Nat. 
Biotechnof 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of 
such therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
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monkeys. 

An additional embodiment of the invention relates to the administration of a pharmaceutical 
or sterile composition, in conjunction with a pharmaceutical ly acceptable carrier, for any of the 
therapeutic effects discussed above. Such pharmaceutical compositions may consist of HYDRL, 
5 antibodies to HYDRL, and mimetics, agonists, antagonists, or inhibitors of HYDRL. The 

compositions may be administered alone or in combination with at least one other agent, such as a 
stabilizing compound, which may be administered in any sterile, biocompatible pharmaceutical carrier 
including, but not limited to, saline, buffered saline, dextrose, and water. The compositions may be 
administered to a patient alone, or in combination with other agents, drugs, or hormones. 

10 The pharmaceutical compositions utilized in this invention may be administered by any 

number of routes including, but not limited to, oral, intravenous, intramuscular, intra- arterial, 
intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, 
enteral, topical, sublingual, or rectal means. 

In addition to the active ingredients, these pharmaceutical compositions may contain suitable 

15 pharmaceutical ly-acceptable carriers comprising excipients and auxiliaries which facilitate processing 
of the active compounds into preparations which can be used pharmaceutically. Further details on 
techniques for formulation and administration may be found in the latest edition o f Remington's 
Pharmaceutical Sciences (Maack Publishing, Easton PA). 

Pharmaceutical compositions for oral administration can be formulated using 

20 pharmaceutically acceptable carriers well known in the art in dosages suitable for oral administration. 
Such carriers enable the pharmaceutical compositions to be formulated as tablets, pills, dragees, 
capsules, liquids, gels, syrups, slurries, suspensions, and the like, for ingestion by the patient. 

Pharmaceutical preparations for oral use can be obtained through combining active 
compounds with solid excipient and processing the resultant mixture of granules (optionally, after 

25 grinding) to obtain tablets or dragee cores. Suitable auxiliaries can be added, if desired. Suitable 

excipients include carbohydrate or protein fillers, such as sugars, including lactose, sucrose, mannitol, 
and sorbitol; starch from corn, wheat, rice, potato, or other plants; cellulose, such as methyl cellulose, 
hydroxypropylmethyl-cellulose, or sodium carboxymethylcellulose; gums, including arabic and 
tragacanth; and proteins, such as gelatin and collagen. If desired, disintegrating or solubilizing agents 

30 may be added, such as the cross-linked polyvinyl pyrrolidone, agar, and alginic acid or a salt thereof, 
such as sodium alginate. 

Dragee cores may be used in conjunction with suitable coatings, such as concentrated sugar 
solutions, which may also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, polyethylene 
glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents or solvent mixtures. 
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Dyestuffs or pigments may be added to the tablets or dragee coatings for product identification or to 
characterize the quantity of active compound, i.e., dosage. 

Pharmaceutical preparations which can be used orally include push-fit capsules made of 
gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as glycerol or sorbitol. 
Push-fit capsules can contain active ingredients mixed with fillers or binders, such as lactose or 
starches, lubricants, such as talc or magnesium stearate, and, optionally, stabilizers. In soft capsules, 
the active compounds may be dissolved or suspended in suitable liquids, such as fatty oils, liquid, or 
liquid polyethylene glycol with or without stabilizers. 

Pharmaceutical formulations suitable for parenteral administration may be formulated in 
aqueous solutions, preferably in physiologically compatible buffers such as Hanks' solution, Ringer's 
solution, or physiologically buffered saline. Aqueous injection suspensions may contain substances 
which increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or 
dextran. Additionally, suspensions of the active compounds may be prepared as appropriate oily 
injection suspensions. Suitable lipophilic solvents or vehicles include fatty oils, such as sesame oil, or 
synthetic fatty acid esters, such as ethyl oleate, triglycerides, or liposomes. Non-lipid polycationic 
amino polymers may also be used for delivery. Optionally, the suspension may also contain suitable 
stabilizers or agents to increase the solubility of the compounds and allow for the preparation of 
highly concentrated solutions. 

For topical or nasal administration, penetrants appropriate to the particular barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art. 

The pharmaceutical compositions of the present invention may be manufactured in a manner 
that is known in the art, e.g., by means of conventional mixing, dissolving, granulating, 
dragee-making, levigating, emulsifying, encapsulating, entrapping, or lyophilizing processes. 

The pharmaceutical composition may be provided as a salt and can be formed with many 
acids, including but not limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, malic, and succinic 
acids. Salts tend to be more soluble in aqueous or other protonic solvents than are the corresponding 
free base forms. In other cases, the preparation may be a lyophilized powder which may contain any 
or all of the following: 1 mM to 50 mM histidine, 0. 1 % to 2% sucrose, and 2% to 7% mannitol, at a 
pH range of 4.5 to 5.5, that is combined with buffer prior to use. 

After pharmaceutical compositions have been prepared, they can be placed in an appropriate 
container and labeled for treatment of an indicated condition. For administration of HYDRL, such 
labeling would include amount, frequency, and method of administration. 

Pharmaceutical compositions suitable for use in the invention include compositions wherein 
the active ingredients are contained in an effective amount to achieve the intended purpose. The 
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determination of an effective dose is well within the capability of those skilled in the art. 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, or pigs. 
An animal model may also be used to determine the appropriate concentration range and route of 
5 administration. Such information can then be used to determine useful doses and routes for 
administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example 
HYDRL or fragments thereof, antibodies of HYDRL, and agonists, antagonists or inhibitors of 
HYDRL, which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be 

10 determined by standard pharmaceutical procedures in cell cultures or with experimental animals, such 
as by calculating the ED 50 (the dose therapeutically effective in 50% of the population) or LD 50 (the 
dose lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the LD 50 /ED 50 ratio. Pharmaceutical compositions 
which exhibit large therapeutic indices are preferred. The data obtained from cell culture assays and 

15 animal studies are used to formulate a range of dosage for human use. The dosage contained in such 
compositions is preferably within a range of circulating concentrations that includes the ED 50 with 
little or no toxicity. The dosage varies within this range depending upon the dosage form employed, 
the sensitivity of the patient, and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related to the 

20 subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the 
active moiety or to maintain the desired effect. Factors which may be taken into account include the 
severity of the disease state, the general health of the subject, the age, weight, and gender of the 
subject, time and frequency of administration, drug combination(s), reaction sensitivities, and 
response to therapy. Long-acting pharmaceutical compositions may be administered every 3 to 4 

25 days, every week, or biweekly depending on the half-life and clearance rate of the particular 
formulation. 

Normal dosage amounts may vary from about 0. 1 ^g to 1 00,000 /ig, up to a total dose of 
about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
30 Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 
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DIAGNOSTICS 

In another embodiment, antibodies which specifically bind HYDRJL may be used for the 
diagnosis of disorders characterized by expression of HYDRL, or in assays to monitor patients being 
treated with HYDRL or agonists, antagonists, or inhibitors of HYDRL. Antibodies useful for 
5 diagnostic purposes may be prepared in the same manner as described above for therapeutics. 
Diagnostic assays for HYDRL include methods which utilize the antibody and a label to detect 
HYDRL in human body fluids or in extracts of cells or tissues. The antibodies may be used with or 
without modification, and may be labeled by covalent or non-covalent attachment of a reporter 
molecule. A wide variety of reporter molecules, several of which are described above, are known in 

10 the art and may be used. 

A variety of protocols for measuring HYDRL, including ELI S As, RIAs, and FACS, are 
known in the art and provide a basis for diagnosing altered or abnormal levels of HYDRL expression. 
Normal or standard values for HYDRL expression are established by combining body fluids or cell 
extracts taken from normal mammalian subjects, for example, human subjects, with antibody to 

15 HYDRL under conditions suitable for complex formation. The amount of standard complex 

formation may be quantitated by various methods, such as photometric means. Quantities of HYDRL 
expressed in subject, control, and disease samples from biopsied tissues are compared with the 
standard values. Deviation between standard and subject values establishes the parameters for 
diagnosing disease. 

20 In another embodiment of the invention, the polynucleotides encoding HYDRL may be used 

for diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 
complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect 
and quantify gene expression in biopsied tissues in which expression of HYDRL may be correlated 
with disease. The diagnostic assay may be used to determine absence, presence, and excess 

25 expression of HYDRL, and to monitor regulation of HYDRL levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 
sequences, including genomic sequences, encoding HYDRL or closely related molecules may be used 
to identify nucleic acid sequences which encode HYDRL. The specificity of the probe, whether it is 
made from a highly specific region, e.g., the 5' regulatory region, or from a less specific region, e.g., a 

30 conserved motif, and the stringency of the hybridization or amplification will determine whether the 
probe identifies only naturally occurring sequences encoding HYDRL, allelic variants, or related 
sequences. 

Probes may also be used for the detection of related sequences, and may have at least 50% 
sequence identity to any of the HYDRL encoding sequences. The hybridization probes of the subject 
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invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO: 1 7-32 or from 
genomic sequences including promoters, enhancers, and introns of the HYDRL gene. 

Means for producing specific hybridization probes for DNAs encoding HYDRL include the 
cloning of polynucleotide sequences encoding HYDRL or HYDRL derivatives into vectors for the 
production of mRNA probes. Such vectors are known in the art, are commercially available, and may 
be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a 
variety of reporter groups, for example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, 
such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding HYDRL may be used for the diagnosis of disorders 
associated with expression of HYDRL. Examples of such disorders include, but are not limited to, a 
cell proliferative disorder, such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, 
hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 
hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia, and a cancer including 
adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 
particular, a cancer of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall 
bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, 
penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; an 
autoimmune/inflammatory disorder, such as inflammation, actinic keratosis, acquired 
immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory distress syndrome, 
allergies, ankylosing spondylitis, amyloidosis, anemia, arteriosclerosis, asthma, atherosclerosis, 
autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune polyendocrinopathy- 
candidiasis-ectodermal dystrophy (APECED), bronchitis, bursitis, cholecystitis, cirrhosis, contact 
dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, 
episodic lymphonpeia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic 
gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's 
thyroiditis, paroxysmal nocturnal hemoglobinuria, hepatitis, hypereosinophilia, irritable bowel 
syndrome, episodic lymphopenia with lymphocytotoxins, mixed connective tissue disease (MCTD), 
multiple sclerosis, myasthenia gravis, myocardial or pericardia! inflammation, myelofibrosis, 
osteoarthritis, osteoporosis, pancreatitis, polycythemia vera, polymyositis, psoriasis, Reiter's 
syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic 
lupus erythematosus, systemic sclerosis, primary thrombocythemia, thrombocytopenic purpura, 
ulcerative colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and 
extracorporeal circulation; viral, bacterial, fungal, parasitic, protozoal, and helminthic infections; 
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trauma, and hematopoietic cancer including lymphoma, leukemia, and myeloma; a neurological 
disorder, such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's 
disease, Pick's disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal 
disorders, amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural 
muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating 
diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess, 
suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system 
disease; prion diseases including kuru, Creutzfeldt-Jakob disease, and Gerstmann- 
Straussler-Scheinker syndrome; fatal familial insomnia, nutritional and metabolic diseases of the 
nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
encephalotrigeminal syndrome, mental retardation and other developmental disorder of the central 
nervous system, cerebral palsy, a neuroskeletal disorder, an autonomic nervous system disorder, a 
cranial nerve disorder, a spinal cord disease, muscular dystrophy and other neuromuscular disorder, a 
peripheral nervous system disorder, dermatomyositis and polymyositis; inherited, metabolic, 
endocrine, and toxic myopathy; myasthenia gravis, periodic paralysis; a mental disorder including 
mood, anxiety, and schizophrenic disorders; seasonal affective disorder (SAD); akathesia, amnesia, 
catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic 
neuralgia, and Tourette's disorder; a renal disorder such as renal amyloidosis, hypertension; primary 
aldosteronism; Addison's disease; renal failure; glomerulonephritis; chronic glomerulonephritis; 
tubulointerstitial nephritis; cystic disorders of the kidney and dysplastic malformations such as 
polycystic disease, renal dysplasias, and cortical or medullary cysts; inherited polycystic renal 
diseases (PRD), such as recessive and autosomal dominant PRD; medullary cystic disease; medullary 
sponge kidney and tubular dysplasia; Alport's syndrome; non-renal cancers which affect renal 
physiology, such as bronchogenic tumors of the lungs or tumors of the basal region of the brain; 
multiple myeloma; adenocarcinomas of the kidney; and metastatic renal carcinoma; an adrenal 
disorder such as angina, anaphylactic shock, arrhythmias, asthma, cardiovascular shock, Cushing's 
syndrome, hypertension, hypoglycemia, myocardial infarction, migraine, and pheochromocytoma; a 
genetic disorder, such as GM1 -gangliosidosis, Niemann-Pick disease, adrenoleukodystrophy, Alport's 
syndrome, choroideremia, Duchenne and Becker muscular dystrophy, Down's syndrome, cystic 
fibrosis, chronic granulomatous disease, Gaucher's disease, Huntington's chorea, Marfan's syndrome, 
muscular dystrophy, myotonic dystrophy, pyenodysostosis, Refsum's syndrome, retinoblastoma, 
sickle cell anemia, thalassemia, Werner syndrome, von Willebrand's disease, von Hippel-Lindau 
syndrome, Wilms' tumor, Zellweger syndrome, peroxisomal acyl-CoA oxidase deficiency, 
peroxisomal thiolase deficiency, peroxisomal bifunctional protein deficiency, mitochondrial carnitine 
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palmitoyl transferase and carnitine deficiency, mitochondrial very-long-chain acyl-CoA 
dehydrogenase deficiency, mitochondrial medium-chain acyl-CoA dehydrogenase deficiency, 
mitochondrial short-chain acyl-CoA dehydrogenase deficiency, mitochondria! electron transport 
flavoprotein and electron transport flavoproteinrubiquinone oxidoreductase deficiency, mitochondrial 
5 trifunctional protein deficiency, and mitochondrial short-chain 3-hydroxyacyl-CoA dehydrogenase 
deficiency. The polynucleotide sequences encoding HYDRL may be used in Southern or northern 
analysis, dot blot, or other membrane-based technologies; in PCR technologies;, in dipstick, pin, and 
multiformat ELISA-Iike assays; and in microarrays utilizing fluids or tissues from patients to detect 
altered HYDRL expression. Such qualitative or quantitative methods are well known in the art. 

10 In a particular aspect, the nucleotide sequences encoding HYDRL may be useful in assays 

that detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
sequences encoding HYDRL may be labeled by standard methods and added to a fluid or tissue 
sample from a patient under conditions suitable for the formation of hybridization complexes. After a 
suitable incubation period, the sample is washed and the signal is quantified and compared with a 

15 standard value. If the amount of signal in the patient sample is significantly altered in comparison to a 
control sample then the presence of altered levels of nucleotide sequences encoding HYDRL in the 
sample indicates the presence of the associated disorder. Such assays may also be used to evaluate 
the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to 
monitor the treatment of an individual patient. 

20 In order to provide a basis for the diagnosis of a disorder associated with expression of 

HYDRL, a normal or standard profile for expression is established. This may be accomplished by 
combining body fluids or cell extracts taken from normal subjects, either animal or human, with a 
sequence, or a fragment thereof, encoding HYDRL, under conditions suitable for hybridization or 
amplification. Standard hybridization may be quantified by comparing the values obtained from 

25 normal subjects with values from an experiment in which a known amount of a substantially purified 
polynucleotide is used. Standard values obtained in this manner may be compared with values 
obtained from samples from patients who are symptomatic for a disorder. Deviation from standard 
values is used to establish the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 

30 hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
patient begins to approximate that which is observed in the normal subject. The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
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overexpressed) in biopsied tissue from an individual may indicate a predisposition for the 
development of the disease, or may provide a means for detecting the disease prior to the appearance 
of actual clinical symptoms. A more definitive diagnosis of this type may allow health professionals 
to employ preventative measures or aggressive treatment earlier thereby preventing the development 
5 or further progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding 
HYDRL may involve the use of PCR. These oligomers may be chemically synthesized, generated 
enzymaticaHy, or produced in vitro . Oligomers will preferably contain a fragment of a polynucleotide 
encoding HYDRL, or a fragment of a polynucleotide complementary to the polynucleotide encoding 

10 HYDRL, and will be employed under optimized conditions for identification of a specific gene or 
condition. Oligomers may also be employed under less stringent conditions for detection or 
quantification of closely related DNA or RNA sequences. 

Methods which may also be used to quantify the expression of HYDRL include radiolabeling 
or biotinylating nucleotides, coampiification of a control nucleic acid, and interpolating results from 

15 standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. 
et al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be 
accelerated by running the assay in a high-throughput format where the oligomer of interest is 
presented in various dilutions and a spectrophotometric or colorimetric response gives rapid 
quantitation. 

20 In further embodiments, oligonucleotides or longer fragments derived from any of the 

polynucleotide sequences described herein may be used as targets in a microarray. The microarray 
can be used to monitor the expression level of large numbers of genes simultaneously and to identify 
genetic variants, mutations, and polymorphisms. This information may be used to determine gene 
function, to understand the genetic basis of a disorder, to diagnose a disorder, and to develop and 

25 monitor the activities of therapeutic agents. 

Microarrays may be prepared, used, and analyzed using methods known in the art. (See, e.g., 
Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. 
USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251 1 16; Shalon, D. et al. 
(1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. Sci. USA 94:2150- 

30 2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) 

In another embodiment of the invention, nucleic acid sequences encoding HYDRL may be 
used to generate hybridization probes useful in mapping the naturally occurring genomic sequence. 
The sequences may be mapped to a particular chromosome, to a specific region of a chromosome, or 
to artificial chromosome constructions, e.g., human artificial chromosomes (HACs), yeast artificial 



45 



WO 00/28045 



PCT/US99/27009 



chromosomes (YACs), bacterial artificial chromosomes (BACs), bacteria! PI constructions, or single 
chromosome cDNA libraries. (See, e.g., Harrington, J.J. et al. ( 1 997) Nat. Genet. 15:345-355; Price, 
CM. (1993) Blood Rev. 7:127-134; and Trask, BJ. (1991) Trends Genet. 7:149-154.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical chromosome 
5 mapping techniques and genetic map data. (See, e.g., Heinz-Ulrich, et al. ( 1 995) in Meyers, supra , 
pp. 965-968.) Examples of genetic map data can be found in various scientific journals or at the 
Online Mendelian Inheritance in Man (OMIM) World Wide Web site. Correlation between the 
location of the gene encoding HYDRL on a physical chromosomal map and a specific disorder, or a 
predisposition to a specific disorder, may help define the region of DNA associated with that disorder. 

10 The nucleotide sequences of the invention may be used to detect differences in gene sequences among 
normal, carrier, and affected individuals. 

In situ hybridization of chromosomal preparations and physical mapping techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gene on the chromosome of another mammalian species, such as mouse, 

15 may reveal associated markers even if the number or arm of a particular human chromosome is not 
known. New sequences can be assigned to chromosomal arms by physical mapping. This provides 
valuable information to investigators searching for disease genes using positional cloning or other 
gene discovery techniques. Once the disease or syndrome has been crudely localized by genetic 
linkage to a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, any sequences mapping 

20 to that area may represent associated or regulatory genes for further investigation. (See, e.g., Gatti, 

R.A. et al. (1988) Nature 336:577-580.) The nucleotide sequence of the subject invention may also be 
used to detect differences in the chromosomal location due to translocation, inversion, etc., among 
normal, carrier, or affected individuals. 

In another embodiment of the invention, HYDRL, its catalytic or immunogenic fragments, or 

25 oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 

screening techniques. The fragment employed in such screening may be free in solution, affixed to a 
solid support, borne on a cell surface, or located intracellularly. The formation of binding complexes 
between HYDRL and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 

30 having suitable binding affinity to the protein of interest. (See, e.g., Geysen, et al. (1984) PCT 
application WO84/03564.) In this method, large numbers of different small test compounds are 
synthesized on a solid substrate. The test compounds are reacted with HYDRL, or fragments thereof, 
and washed. Bound HYDRL is then detected by methods well known in the art. Purified HYDRL 
can also be coated directly onto plates for use in the aforementioned drug screening techniques. 
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Alternatively, non-neutralizing antibodies can be used to capture the peptide and immobilize it on a 
solid support. 

In another embodiment, one may use competitive drug screening assays in which neutralizing 
antibodies capable of binding HYDRL specifically compete with a test compound for binding 
5 HYDRL. In this manner, antibodies can be used to detect the presence of any peptide which shares 
one or more antigenic determinants with HYDRL. 

In additional embodiments, the nucleotide sequences which encode HYDRL may be used in 
any molecular biology techniques that have yet to be developed, provided the new techniques rely on 
properties of nucleotide sequences that are currently known, including, but not limited to, such 
10 properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 
description, utilize the present invention to its fullest extent. The following preferred specific 
embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder 
of the disclosure in any way whatsoever. 
15 The disclosures of all patents, applications, and publications mentioned above and below, in 

particular U.S. Ser. No. [Attorney Docket No. PF-0634 P, filed November 12, 1998] and U.S. Ser. 
No. 60/135,519, are hereby expressly incorporated by reference. 

EXAMPLES 

20 L Construction of cDNA Libraries 

RNA was purchased from Clontech or isolated from tissues described in Table 4. Some 
tissues were homogenized and lysed in guanidinium isothiocyanate, while others were homogenized 
and lysed in phenol or in a suitable mixture of denaturants, such as TR1ZOL (Life Technologies), a 
monophasic solution of phenol and guanidine isothiocyanate. The resulting lysates were centrifuged 
25 over CsCl cushions or extracted with chloroform. RNA was precipitated from the lysates with either 
isopropanol or sodium acetate and ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity- In some cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was isolated 
using oligo d(T)-coupled paramagnetic particles (Promega), OL1GOTEX latex particles (QIAGEN, 
30 Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was 

isolated directly from tissue lysates using other RNA isolation kits, e.g., the POLY(A)PURE mRNA 
purification kit (Ambion, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 
libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP 
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vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the 
recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 1 997. supra , units 
5.1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 
oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with the 
5 appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300- 
1000 bp) using SEPHACRYL SI 000, SEPHAROSE CL2B, or SEPHAROSE CL4B column 
chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs 
were ligated into compatible restriction enzyme sites of the poiylinker of a suitable plasmid, e.g., 
PBLUESCRJPT plasmid (Stratagene), PSPORT1 plasmid (Life Technologies), or pINCY (Incyte 

10 Pharmaceuticals, Palo Alto CA). Recombinant plasmids were transformed into competent E. col i 
cells including XLl-Blue, XLl-BlueMRF, or SOLR from Stratagene or DH5a, DH10B, or 
ElectroMAX DH10B from Life Technologies. 
IL Isolation of cDNA Clones 

Plasmids were recovered from host cells bv in vivo excision using the UNIZAP vector system 

15 (Stratagene) or by cell lysis. Plasmids were purified using at least one of the following: a Magic or 
WIZARD Minipreps UNA purification system (Promega); an AGTC Miniprep purification kit (Edge 
Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, QIAWELL 8 Plus Plasmid, QIAWELL 8 
Ultra Plasmid purification systems or the R.E.A.L. PREP 96 plasmid purification kit from QIAGEN, 
Following precipitation, plasmids were resuspended in 0.1 ml of distilled water and stored, with or 

20 without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 
high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 
384-well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically 

25 using PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence 
scanner (Labsystems Oy, Helsinki, Finland). 
III. Sequencing and Analysis 

cDNA sequencing reactions were processed using standard methods or high-throughput 
instrumentation such as the ABI CATALYST 800 (Perkin-Elmer) thermal cycler or the PTC-200 

30 thermal cycler (MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or 
the MICROLAB 2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared 
using reagents provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such as 
the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Perkin-Elmer). 
Electrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides 
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were carried out using the MEGABACE 1000 DMA sequencing system (Molecular Dynamics); the 
ABI PRISM 373 or 377 sequencing system (Perkin-Elmer) in conjunction with standard ABI 
protocols and base calling software; or other sequence analysis systems known in the art. Reading 
frames within the cDNA sequences were identified using standard methods (reviewed in Ausubel, 
1997, supra , unit 7.7). Some of the cDNA sequences were selected for extension using the techniques 
disclosed in Example VI. 

The polynucleotide sequences derived from cDNA sequencing were assembled and analyzed 
using a combination of software programs which utilize algorithms well known to those skilled in the 
art. Table 5 summarizes the tools, programs, and algorithms used and provides applicable 
descriptions, references, and threshold parameters. The first column of Table 5 shows the tools, 
programs, and algorithms used, the second column provides brief descriptions thereof, the third 
column presents appropriate references, all of which are incorporated by reference herein in their 
entirety, and the fourth column presents, where applicable, the scores, probability values, and other 
parameters used to evaluate the strength of a match between two sequences (the higher the score, the 
greater the homology between two sequences). Sequences were analyzed using MACDNASIS PRO 
software (Hitachi Software Engineering, South San Francisco CA) and LASERGENE software 
(DNASTAR). Polynucleotide and polypeptide sequence alignments were generated using the default 
parameters specified by the clustal algorithm as incorporated into the MEGALIGN multisequence 
alignment program (DNASTAR), which also calculates the percent identity between aligned 
sequences. 

The polynucleotide sequences were validated by removing vector, linker, and polyA 
sequences and by masking ambiguous bases, using algorithms and programs based on BLAST, 
dynamic programing, and dinucleotide nearest neighbor analysis. The sequences were then queried 
against a selection of public databases such as the GenBank primate, rodent, mammalian, vertebrate, 
and eukaryote databases, and BLOCKS, PRINTS, DOMO, PRODOM, and PFAM to acquire 
annotation using programs based on BLAST, FASTA, and BLIMPS. The sequences were assembled 
into full length polynucleotide sequences using programs based on Phred, Phrap, and Consed, and 
were screened for open reading frames using programs based on GeneMark, BLAST, and FASTA. 
The full length polynucleotide sequences were translated to derive the corresponding full length 
amino acid sequences, and these full length sequences were subsequently analyzed by querying 
against databases such as the GenBank databases (described above), SwissProt, BLOCKS, PRINTS, 
DOMO, PRODOM, Prosite, and Hidden Markov Model (HMM)-based protein family databases such 
as PFAM. HMM is a probabilistic approach which analyzes consensus primary structures of gene 
families. (See, e.g., Eddy, S.R. (1996) Curr. Opin. Struct. Biol. 6:361-365.) 
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The programs described above for the assembly and analysis of full length polynucleotide and 
amino acid sequences were also used to identify polynucleotide sequence fragments from SEQ ID 
NO: 17-32. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 
amplification technologies were described in The Invention section above. 
5 IV. Northern Analysis 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a 
gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs 
from a particular cell type or tissue have been bound. (See, e.g., Sambrook, supra , ch. 7; Ausubel, 
1995, supra , ch. 4 and 16.) 

10 Analogous computer techniques applying BLAST were used to search for identical or related 

molecules in nucleotide databases such as GenBank or LIFESEQ (Incyte Pharmaceuticals). This 
analysis is much faster than multiple membrane-based hybridizations. In addition, the sensitivity of 
the computer search can be modified to determine whether any particular match is categorized as 
exact or similar. The basis of the search is the product score, which is defined as: 

15 % sequence identity x % maximum BLAST score 

100 

The product score takes into account both the degree of similarity between two sequences and the 
length of the sequence match. For example, with a product score of 40, the match will be exact within 
a 1% to 2% error, and, with a product score of 70, the match will be exact. Similar molecules are 

20 usually identified by selecting those which show product scores between 15 and 40, although lower 
scores may identify related molecules. 

The results of northern analyses are reported as a percentage distribution of libraries in which 
the transcript encoding HYDRL occurred. Analysis involved the categorization of cDNA libraries by 
organ/tissue and disease. The organ/tissue categories included cardiovascular, dermatologic, 

25 developmental, endocrine, gastrointestinal, hematopoietic/immune, musculoskeletal, nervous, 

reproductive, and urologic. The disease/condition categories included cancer, inflammation, trauma, 
cell proliferation, neurological, and pooled. For each category, the number of libraries expressing the 
sequence of interest was counted and divided by the total number of libraries across all categories. 
Percentage values of tissue-specific and disease- or condition-specific expression are reported in 

30 Table 3. 

V. Chromosomal Mapping of HYDRL, Encoding Polynucleotides 

The cDNA sequences which were used to assemble SEQ ID NO: 1 8-32 were compared with 
sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 
implementations of the Smith- Waterman algorithm. Sequences from these databases that matched 
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SEQ ID NO: 1 8-32 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 5). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
Genome Research (WIGR), and Genethon were used to determine if any of the clustered sequences 
5 had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 
of all sequences of that cluster, including its particular SEQ ID NO:, to that map location. 

The genetic map locations of SEQ ID NO: 1 8, SEQ ID NO:25, and SEQ ID NO:28 are 
described in The Invention as ranges, or intervals, of human chromosomes. More than one map 
location is reported for SEQ ID NO: 1 8 and SEQ ID NO:25, indicating that previously mapped 

10 sequences having similarity, but not complete identity, to SEQ ID NO: 1 8 and SEQ ID NO:25 were 
assembled into their respective clusters. The map position of an interval, in centiMorgans, is 
measured relative to the terminus of the chromosome's p-arm. (The centiMorgan (cM) is a unit of 
measurement based on recombination frequencies between chromosomal markers. On average, 1 cM 
is roughly equivalent to 1 megabase (Mb) of DNA in humans, although this can vary widely due to 

15 hot and cold spots of recombination.) The cM distances are based on genetic markers mapped by 
Genethon which provide boundaries for radiation hybrid markers whose sequences were included in 
each of the clusters. Diseases associated with the public and Incyte sequences located within the 
indicated intervals are also reported in the Invention where applicable. 
VI. Extension of HYDRL Encoding Polynucleotides 

20 The full length nucleic acid sequences of SEQ ID NO: 1 7-32 were produced by extension of 

an appropriate fragment of the full length molecule using oligonucleotide primers designed from this 
fragment. One primer was synthesized to initiate 5* extension of the known fragment, and the other 
primer, to initiate 3' extension of the known fragment. The initial primers were designed using 
OLIGO 4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 

25 nucleotides in length, to have a GC content of about 50% or more, and to anneal to the target 

sequence at temperatures of about 68°C to about 72°C. Any stretch of nucleotides which would 
result in hairpin structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 
extension was necessary or desired, additional or nested sets of primers were designed. 

30 High fidelity amplification was obtained by PCR using methods well known in the art. PCR 

was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 
mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2+ , (NH 4 ) 2 S0 4? 
and p-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme 
(Life Technologies), and Pfu DNA polymerase (Stratagene), with the following parameters for primer 
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pair PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 68°C, 
2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1 : 94°C, 3 min; Step 2: 
94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 
5 Step 6: 68°C, 5 min; Step 7: storage at 4°C. 

The concentration of DNA in each well was determined by dispensing 100 pi PICOGREEN 
quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 pi of undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, 
Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan II 
10 (Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
concentration of DNA. A 5 /u\ to 10 ^1 aliquot of the reaction mixture was analyzed by 
electrophoresis on a 1 % agarose mini-gel to determine which reactions were successful in extending 
the sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-weIl plates, 

15 digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) 
agarose gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended clones 
were religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 1 8 vector (Amersham 

20 Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site 
overhangs, and transfected into competent E. coli cells. Transformed cells were selected on 
antibiotic-containing media, individual colonies were picked and cultured overnight at 37°C in 384- 
well plates in LB/2x carb liquid media. 

The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase 

25 (Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following 

parameters: Step 1 : 94°C, 3 min; Step 2: 94°C, 1 5 sec; Step 3: 60°C, 1 min; Step 4: 72°C, 2 min; 
Step 5: steps 2, 3, and 4 repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA was 
quantified by PICOGREEN reagent (Molecular Probes) as described above. Samples with low DNA 
recoveries were reamplified using the same conditions as described above. Samples were diluted with 

30 20% dimethysulfoxide (1 :2, v/v), and sequenced using DYENAMIC energy transfer sequencing 
primers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM 
BIGDYE Terminator cycle sequencing ready reaction kit (Perkin-Elmer). 

In like manner, the nucleotide sequences of SEQ ID NO: 1 7-32 are used to obtain 5' 
regulatory sequences using the procedure above, oligonucleotides designed for such extension, and an 
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appropriate genomic library. 

VII. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO: 17-32 are employed to screen cDNAs, 
genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 
pairs, is specifically described, essentially the same procedure is used with larger nucleotide 
fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 
software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 ^uCi of 
[y- 32 P] adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase 
(DuPont NEN, Boston MA). The labeled oligonucleotides are substantially purified using a 
SEPHADEX G-25 superfine size exclusion dextran bead column (Amersham Pharmacia Biotech). 
An aliquot containing 10 7 counts per minute of the labeled probe is used in a typical membrane-based 
hybridization analysis of human genomic DNA digested with one of the following endonucleases: 
Ase I, Bgl II, Eco RI, Pst I, Xba I, or Pvu II (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
hours at 40°C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
compared. 

VIII. Microarrays 

A chemical coupling procedure and an ink jet device can be used to synthesize array 
elements on the surface of a substrate. (See, e.g., Baldeschweiler, supra ,) An array analogous to a dot 
or slot blot may also be used to arrange and link elements to the surface of a substrate using thermal, 
UV, chemical, or mechanical bonding procedures. A typical array may be produced by hand or using 
available methods and machines and contain any appropriate number of elements. After 
hybridization, nonhybridized probes are removed and a scanner used to determine the levels and 
patterns of fluorescence. The degree of complementarity and the relative abundance of each probe 
which hybridizes to an element on the microarray may be assessed through analysis of the scanned 
images. 

Full-length cDNAs, Expressed Sequence Tags (ESTs), or fragments thereof may comprise 
the elements of the microarray. Fragments suitable for hybridization can be selected using software 
well known in the art such as LASERGENE software (DNASTAR). Full-length cDNAs, ESTs, or 
fragments thereof corresponding to one of the nucleotide sequences of the present invention, or 
selected at random from a cDNA library relevant to the present invention, are arranged on an 
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appropriate substrate, e.g., a glass slide. The cDNA is fixed to the slide using, e.g., UV cross-linking 
followed by thermal and chemical treatments and subsequent drying. (See, e.g., Schena, M. et al. 
(1995) Science 270:467-470; Shalon, D. et al. (1996) Genome Res. 6:639-645.) Fluorescent probes 
are prepared and used for hybridization to the elements on the substrate. The substrate is analyzed by 
procedures described above. 

IX. Complementary Polynucleotides 

Sequences complementary to the HYDRL-encoding sequences, or any parts thereof, are used 
to detect, decrease, or inhibit expression of naturally occurring HYDRL. Although use of 
oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same 
procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides are 
designed using OLIGO 4.06 software (National Biosciences) and the coding sequence of HYDRL. 
To inhibit transcription, a complementary oligonucleotide is designed from the most unique 5* 
sequence and used to prevent promoter binding to the coding sequence. To inhibit translation, a 
complementary oligonucleotide is designed to prevent ribosomal binding to the HYDRL-encoding 
transcript. 

X. Expression of HYDRL 

Expression and purification of HYDRL is achieved using bacterial or virus-based expression 
systems. For expression of HYDRL in bacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 
transcription. Examples of such promoters include, but are not limited to, the trp-Iac {tac) hybrid 
promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory 
element Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express HYDRL upon induction with isopropyl beta-D- 
thiogalactopyranoside (IPTG). Expression of HYDRL in eukaryotic cells is achieved by infecting 
insect or mammalian cell lines with recombinant Autographica californica nuclear polyhedrosis virus 
(AcMNPV), commonly known as baculovirus. The nonessential polyhedrin gene of baculovirus is 
replaced with cDNA encoding HYDRL by either homologous recombination or bacterial-mediated 
transposition involving transfer plasm id intermediates. Viral infectivity is maintained and the strong 
polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 
infect Spodoptera frugiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. 
Infection of the latter requires additional genetic modifications to baculovirus. (See Engelhard, E.K. 
et al. (1994) Proc. Natl. Acad. Sci. USA 91:3224-3227; Sandig, V. et ai. (\996) Hum. Gene Then 
7:1937-1945.) 

In most expression systems, HYDRL is synthesized as a fusion protein with, e.g., glutathione 
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S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
affinity-based purification of recombinant fusion protein from crude cell iysates. GST, a 26- 
kilodalton enzyme from Schistosoma iaponicum , enables the purification of fusion proteins on 
immobilized glutathione under conditions that maintain protein activity and antigenicity (Amersham 
5 Pharmacia Biotech). Following purification, the GST moiety can be proteolytically cleaved from 
HYDRL at specifically engineered sites. FLAG, an 8-amino acid peptide, enables imrnunoafflnity 
purification using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman 
Kodak). 6-His, a stretch of six consecutive histidine residues, enables purification on metal-chelate 
resins (Q1AGEN). Methods for protein expression and purification are discussed in Ausubel (1995, 
10 supra , ch. 10 and 16). Purified HYDRL obtained by these methods can be used directly in the 
following activity assay. 

XI. Demonstration of HYDRL Activity 

For purposes of example, assays measuring the p-glucosidase activity and the lysozyme 
activity of an FIYDRL molecule are described. In a P-glucosidase activity assay, varying amounts of 

15 HYDRL are incubated with 1 mM 4-nitrophenyl p-D-glycopyranoside (a substrate) in 50 mM sodium 
acetate buffer, pH 5.0, for various times (typically 1-5 minutes) at 37°C. The reaction is halted by 
heating to 100°C for 2 minutes. The absorbance is measured spectrophotometricaily at 41 0 nm, and 
is proportional to the p-glucosidase activity of HYDRL in the sample. (See, e.g., Hrmova, M. et al. 
(1998) J. Biol. Chem. 273:1 1 134-1 1 143.) 

20 Lysozyme activity of HYDRL is demonstrated by its ability to lvse Micrococcus 

lysodeikticus bacterial cells. (See, e.g.. Enzymatic Assay of Lvsozvme L Sigma Aldrich, St. Louis 
MO). A 0.015% suspension of lyophilized Micrococcus lysodeikticus cells (ATCC 4698) is prepared 
in 66 mM potassium phosphate buffer, pH 6.24 (Buffer A) at 25°C. A 2.5 ml aliquot of the cell 
suspension is pipetted into a optical cuvette and equilibrated to 25 °C. The absorbance at 450 nm is 

25 monitored until constant, between 0.6 and 0.7, using a thermostatted spectrophotometer. A blank 
reaction is prepared in a second cuvette containing 2.5 ml Buffer A. HYDRL is dissolved in cold 
Buffer A. A 0.1 ml aliquot of the HYDRL solution is added to the test cuvette, and 0.1 ml Buffer A is 
added to the blank cuvette. The cuvettes are immediately mixed by inversion, and the decrease in 
absorbance at 450 nm is recorded for approximately 5 minutes. As the bacteria lyse, the turbidity of 

30 the solution, and hence the absorbance at 450 nm, decrease. The rate of the decrease in absorbance at 
450 nm in the test cuvette is proportional to the lysozyme activity of HYDRL in the original sample. 
XII- Functional Assays 

HYDRL function is assessed by expressing the sequences encoding HYDRL at 
physiologically elevated levels in mammalian cell culture systems. cDNA is subcloned into a 
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mammalian expression vector containing a strong promoter that drives high levels of cDNA 
expression. Vectors of choice include pCMV SPORT (Life Technologies) and pCR3.1 (Invitrogen, 
Carlsbad CA), both of which contain the cytomegalovirus promoter. 5-10 jug of recombinant vector 
are transiently transfected into a human cell line, for example, an endothelial or hematopoietic cell 
5 line, using either liposome formulations or electroporation. 1-2 jug of an additional plasmid 

containing sequences encoding a marker protein are co-transfected. Expression of a marker protein 
provides a means to distinguish transfected cells from nontransfected cells and is a reliable predictor 
of cDNA expression from the recombinant vector. Marker proteins of choice include, e.g., Green 
Fluorescent Protein (GFP; Clontech), CD64, or a CD64-GFP fusion protein. Flow cytometry (FCM), 

10 an automated, laser optics-based technique, is used to identify transfected cells expressing GFP or 
CD64-GFP and to evaluate the apoptotic state of the cells and other cellular properties. FCM detects 
and quantifies the uptake of fluorescent molecules that diagnose events preceding or coincident with 
cell death. These events include changes in nuclear DNA content as measured by staining of DNA 
with propidium iodide; changes in cell size and granularity as measured by forward light scatter and 

15 90 degree side light scatter; down-regulation of DNA synthesis as measured by decrease in 

bromodeoxyuridine uptake; alterations in expression of cell surface and intracellular proteins as 
measured by reactivity with specific antibodies; and alterations in plasma membrane composition as 
measured by the binding of fluorescein-conjugated Annexin V protein to the cell surface. Methods in 
flow cytometry are discussed in Ormerod, M.G. ( 1 994^ Flow Cytometry . Oxford, New York NY. 

20 The influence of HYDRL on gene expression can be assessed using highly purified 

populations of cells transfected with sequences encoding HYDRL and either CD64 or CD64-GFP. 
CD64 and CD64-GFP are expressed on the surface of transfected cells and bind to conserved regions 
of human immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected 
cells using magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake 

25 Success NY). mRNA can be purified from the cells using methods well known by those of skill in the 
art. Expression of mRNA encoding HYDRL and other genes of interest can be analyzed by northern 
analysis or microarray techniques. 
XIII. Production of HYDRL Specific Antibodies 

HYDRL substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 

30 Harrington, M.G. (1990) Methods Enzymol. 1 82:488-495), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the HYDRL amino acid sequence is analyzed using LASERGENE software 
(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
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selection of appropriate epitopes, such as those near the C-tenninus or in hydrophilic regions are well 
described in the art. (See, e.g., Ausubel, 1995, supra , ch. 1 1.) 

Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431 A 
peptide synthesizer (Perkin-Elmer) using fmoc-chemistry and coupled to KJLH (Sigma-Aldrich, St. 
5 Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to increase 
immunogenicity. (See, e.g., Ausubel, 1995. supra .) Rabbits are immunized with the oligopeptide- 
KLH complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide and anti- 
HYDRL activity by, for example, binding the peptide or HYDRL to a substrate, blocking with 1% 
BSA, reacting with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 

10 XIV. Purification of Naturally Occurring HYDRL Using Specific Antibodies 

Naturally occurring or recombinant HYDRL is substantially purified by immunoaffinity 
chromatography using antibodies specific for HYDRL. An immunoaffinity column is constructed by 
covalently coupling anti-HYDRL antibody to an activated chromatographic resin, such as 
CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 

15 blocked and washed according to the manufacturer's instructions. 

Media containing HYDRL are passed over the immunoaffinity column, and the column is 
washed under conditions that allow the preferential absorbance of HYDRL (e.g., high ionic strength 
buffers in the presence of detergent). The column is eluted under conditions that disrupt 
antibody/HYDRL binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such 

20 as urea or thiocyanate ion), and HYDRL is collected. 

XV. Identification of Molecules Which Interact with HYDRL 

HYDRL, or biologically active fragments thereof, are labeled with ,25 I Bolton-Hunter 
reagent. (See, e.g., Bolton A.E. and W.M. Hunter (1973) Biochem. J. 133:529-539.) Candidate 
25 molecules previously arrayed in the wells of a multi-well plate are incubated with the labeled 

HYDRL, washed, and any wells with labeled HYDRL complex are assayed. Data obtained using 
different concentrations of HYDRL are used to calculate values for the number, affinity, and 
association of HYDRL with the candidate molecules. 

30 Various modifications and variations of the described methods and systems of the invention 

will be apparent to those skilled in the art without departing from the scope and spirit of the invention. 
Although the invention has been described in connection with certain embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the described modes for carrying out the invention which are 
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obvious to those skilled in molecular biology or related fields are intended to be within the scope of 
the following claims. 
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What is claimed is: 

1 A substantially purified polypeptide comprising an amino acid sequence selected 

from the group consisting of SEQ ID NO: 1 , SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID 
5 NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 1 0, SEQ ID NO: 1 1 , SEQ ID NO: 1 2, 
SEQ ID NO: 1 3 , SEQ ID NO: 1 4, SEQ ID NO: 1 5, SEQ ID NO: 1 6, and fragments thereof. 

2. A substantially purified variant having at least 90% amino acid sequence identity to 
the amino acid sequence of claim 1. 

10 

3. An isolated and purified polynucleotide encoding the polypeptide of claim 1 . 

4. An isolated and purified polynucleotide variant having at least 90% polynucleotide 
sequence identity to the polynucleotide of claim 3. 

15 

5. An isolated and purified polynucleotide which hybridizes under stringent conditions 
to the polynucleotide of claim 3. 

6. An isolated and purified polynucleotide having a sequence which is complementary 
20 to the polynucleotide of claim 3 . 

7. A method for detecting a polynucleotide, the method comprising the steps of: 
(a) hybridizing the polynucleotide of claim 6 to at least one nucleic acid in a 

sample, thereby forming a hybridization complex; and 
25 (b) detecting the hybridization complex, wherein the presence of the 

hybridization complex correlates with the presence of the polynucleotide in the sample. 

8. The method of claim 7 further comprising amplifying the polynucleotide prior to 
hybridization. 

30 

9. An isolated and purified polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of SEQ ID NO: 1 7-32 and fragments thereof. 

1 0. An isolated and purified polynucleotide variant having at least 90% polynucleotide 
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sequence identity to the polynucleotide of claim 9. 

1 1 . An isolated and purified polynucleotide having a sequence which is complementary 
to the polynucleotide of claim 9. 

12. An expression vector comprising at least a fragment of the polynucleotide of claim 3. 

13. A host cell comprising the expression vector of claim 12. 

14. A method for producing a polypeptide, the method comprising the steps of: 

a) culturing the host cell of claim 1 3 under conditions suitable for the expression 
of the polypeptide; and 

b) recovering the polypeptide from the host cell culture. 

15. A pharmaceutical composition comprising the polypeptide of claim 1 in conjunction 
with a suitable pharmaceutical carrier. 

16. A purified antibody which specifically binds to the polypeptide of claim 1. 

1 7. A purified agonist of the polypeptide of claim 1 . 

1 8. A purified antagonist of the polypeptide of claim 1 . 

19. A method for treating or preventing a disorder associated with decreased expression 
or activity of HYDRL, the method comprising administering to a subject in need of such treatment an 
effective amount of the pharmaceutical composition of claim 15. 

20. A method for treating or preventing a disorder associated with increased expression 
or activity of HYDRL, the method comprising administering to a subject in need of such treatment an 
effective amount of the antagonist of claim 1 8. 
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DECLARATION AND POWER OF ATTORNEY FOR 
UNITED STATES PATENT APPLICATION 



As a below named inventor, I hereby declare that: 
-My residence, post office address and citizenship are as stated below next to my name, 

and 

I believe that I am the original, first and sole inventor (if only one name is listed below) 
or an original, first and joint inventor (if more than one name is listed below) of the subject 
matter which is claimed and for which a United States patent is sought on the invention entitled 

HUMAN HYDROLASE PROTEINS 

the specification of which: 
/ / is attached hereto. 



/ / was filed on as application Serial No. and if this box 

contains an X / /, was amended on . . 

/X / was filed as Patent Cooperation Treaty international application No. PCT/US99/27009 
on November 12, 1999, if this box contains an X /_/, was amended on under Patent Cooperation 
Treaty Article 19 on 2001, and if this box contains an X /_/, was amended on . 

I hereby state that I have reviewed and understand the contents of the above- identified 
specification, including the claims, as amended by any amendment referred to above. 

I acknowledge my duty to disclose information which is material to the examination of 
this application in accordance with Title 37, Code of Federal Regulations, § 1 .56(a). 

I hereby claim the benefit under Title 35, United States Code, § 1 19 or §365(a)-(b) of any 
foreign applications) for patent or inventor's certificate indicated below and of any Patent 
Cooperation Treaty international applications(s) designating at least one country other than the 
United States indicated below and have also identified below any foreign application(s) for 
patent or inventor's certificate and Patent Cooperation Treaty international application(s) 
designating at least one country other than the United States for the same subject matter and 
having a filing date before that of the application for said subject matter the priority of which is 
claimed: 

Country Number Filing Date Priority Claimed 

//Yes //No 

/_/Yes //No 
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I hereby claim the benefit under Title 35, United States Code, § 1 19(e) of any United 
States provisional application(s) listed below. 

Application Status (Pending, 

Serial No. Filed Abandoned, Patented) 

60/172,256 November 12, 1998 Expired 

60/135,519 May 21. 1999 Expired 

I hereby claim the benefit under Title 35, United States Code, §120 of any United States 
application(s) listed below and, insofar as the subject matter of each of the claims of this 
application is not disclosed in said prior application(s) in the manner required by the first 
paragraph of Title 35, United States Code §112,1 acknowledge my duty to disclose material 
information as defined in Title 37 Code of Federal Regulations, § 1.56(a) which occurred between 
the filing date(s) of the prior application(s) and the national or Patent Cooperation Treaty 
international filing date of this application: 

Application Status (Pending, 

Serial No. Filed Abandoned, Patented) 



I hereby appoint the following: 

Lucy J. Billings 
Michael C. Cerrone 
Diana Hamlet-Cox 
Richard C. Ekstrom 
Barrie D. Greene 
Matthew R. Kaser 
Lynn E. Murry 
Shirley A. Recipon 
Susan K. Sather 
Michelle M. Stempien 
David G. Streeter 
Stephen Todd 
Christopher Turner 
P. Ben Wang 



respectively and individually, as my patent attorneys and/or agents, with full power of 
substitution and revocation, to prosecute this application and to transact all business in the Patent 
and Trademark Office connected therewith. Please address all communications to: 



Reg. No. 36/749 
Reg. No .. 39.132 
Reg. No. 33,302 
Reg. No ."37^02T 
Reg. No . 46,740 
Reg. N o. 44.817 
Reg. N o. 42,918 
Reg. N o. 47,016 
Reg. No^44316^ 
Reg. N o74132T 
Reg. N o. 43,16 8 
Reg. N o. 47,139 
Reg. No. 45,167 
Reg. No. 41,420 
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T.KCAT. DEPARTMKNT 
TNCYTF CFNOMir;S J INC. 
3160 PORTER DRI VE. PALO ALTO. CA 94304 

TEL: 650-855-0555 FAX: 650-849-8886 or 650-845-4166 



I hereby declare that all statements made herein of my own knowledge are true and that 
all statements made on information and belief are believed to be true; and further that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code and that such willful false statements may jeopardize the validity of the application or any 
patent issuing thereon. 



Sole Inventor or 

First Joint Inventor: j — <C)0 



Full name: 
Signature: 
Date: 

Citizenship 
Residence: 

P.O. Address: 



Y. Tom Tang 



,2001 



Peopled Republic of China U 

San Jose . California^/? *^ * 

4230 Ranwick Court 
San Jose, California 95118 



Second Joint Inventor: 



Full name: 
Signature: 
Date: 

Citizenship 
Residence: 
P.O. Address: 




United State^bf America 

Mountain View, California C/9 

230 Monroe Drive, #17 
Mountain View, California 94040 
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Signature: 
Date: 

Citizenship 
Residence: 
P.O. Address: 
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SjirmyvalPj California 

826 Lois Avenue 
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Fourth Joint Inventor: 



Full name: 
Signature: 
Date: 

Citizenship 
Residence: 

P.O. Address: 



Preeti Lai 



2001 



India 



ira, California (pjQ. 
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Date: 

Citizenship 
Residence: 
P.O. Address: 



Olga Sandman 
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Date: 

Citizenship 
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SEQUENCE LISTING 



<110> INCYTE PHARMACEUTICALS , INC. 
TANG , Y . Tom 
HILLMAN, Jennifer L . 
YUE , Henry 
LAL, Preeti 
BANDMAN, Olga 
CORLEY , Neil C. 
GUEGLER, Karl J. 
BAUGHN, Mariah R . 
LU, Dyung Aina M . 
AZIMZAI, Yalda 
YANG , Junming 

<12 0> HUMAN HYDROLASE PROTEINS 

<130> PF-0634 PCT 

<140> To Be Assigned 

<141> Herewith 

<150> 09/190,937; unassigned; 60/135,519 

<151> 1998-11-12; 1998-11-12; 1999-05-21 

<160> 35 

<17 0> PERL Program 

<210> 1 
<211> 159 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2293764CD1 

<400> 1 

Met Lys Ala Trp Gly Thr Val Val Val Thr Leu Ala Thr Leu Met 
1.5 10 15 

Val Val Thr Val Asp Ala Lys lie Tyr Glu Leu Cys Glu Leu Ala 

20 25 30 

Ala Arg Leu Glu Arg Ala Gly Leu Asn Gly Tyr Lys Gly Tyr Gly 

35 40 45 

Val Gly Asp Trp Leu Cys Met Ala His Tyr Glu Ser Gly Phe Asp 

50 55 60 

Thr Ala Phe Val Asp His Asn Pro Asp Gly Ser Ser Glu Tyr Gly 

65 70 75 

lie Phe Gin Leu Asn Ser Ala Trp Trp Cys Asp Asn Gly lie Thr 

80 - 85 90 

Pro Thr Lys Asn Leu Cys His Met Asp Cys His Asp Leu Leu Asn 

95 100 105 

Arg His lie Leu Asp Asp lie Arg Cys Ala Lys Gin lie Val Ser 
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110 115 120 

Ser Gin Asn Gly Leu Ser Ala Trp Thr Ser Trp Arg Leu His Cys 

125 130 135 

Ser Gly His Asp Leu Ser Glu Trp Leu Lys Gly Cys Asp Met His 

140 145 150 

Val Lys lie Asp Pro Lys lie His Pro 

155 



<210> 2 
<211> 285 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc__f eature 

<223> Incyte ID No: 949738CD1 

<400> 2 

Met Gly Thr Pro Gly Glu Gly Leu Gly Arg Cys Ser His Ala Leu 
1 5 10 15 

lie Arg Gly Val Pro Glu Ser Leu Ala Ser Gly Glu Gly Ala Gly 
20 25 30 

Ala Gly Leu Pro Ala Leu Asp Leu Ala Lys Ala Gin Arg Glu His 
35 40 45 

Gly Val Leu Gly Gly Lys Leu Arg Gin Arg Leu Gly Leu Gin Leu 
50 55 60 

Leu Glu Leu Pro Pro Glu Glu Ser Leu Pro Leu Gly Pro Leu Leu 
65 70 75 

Gly Asp Thr Ala Val lie Gin Gly Asp Thr Ala Leu lie Thr Arg 
80 85 90 

Pro Trp Ser Pro Ala Arg Arg Pro Glu Val Asp Gly Val Arg Lys 
95 100 105 

Ala Leu Gin Asp Leu Gly Leu Arg lie Val Glu lie Gly Asp Glu 
110 115 120 

Asn Ala Thr Leu Asp Gly Thr Asp Val Leu Phe Thr Gly Arg Glu 
125 130 135 

Phe Phe Val Gly Leu Ser Lys Trp Thr Asn His Arg Gly Ala Glu 
140 145 150 

He Val Ala Asp Thr Phe Arg Asp Phe Ala Val Ser Thr Val Pro 
155 160 165 

Val Ser Gly Pro Ser His Leu Arg Gly Leu Cys Gly Met Gly Gly 
170 175 180 

Pro Arg Thr Val Val Ala Gly Ser Ser Asp Ala Ala Gin Lys Ala 
185 190 195 

Val Arg Ala Met Ala Val Leu Thr Asp His Pro Tyr Ala Ser Leu 
200 205 210 

Thr Leu Pro Asp Asp Ala Ala Ala Asp Cys Leu Phe Leu Arg Pro 
215 220 225 

Gly Leu Pro Gly Val Pro Pro Phe Leu Leu His Arg Gly Gly Gly 
230 235 240 

Asp Leu Pro Asn Ser Gin Glu Ala Leu' Gin Lys Leu Ser Asp Val 
245 250 255 

Thr Leu Val Pro Val Ser Cys Ser Glu Leu Glu Lys Ala Gly Ala 
260 265 270 
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Gly Leu Ser Ser Leu Cys Leu Val Leu Ser Thr Arg Pro His Ser 
275 280 285 



<210> 3 
<211> 331 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1297034CD1 

<400> 3 



Met 


Trp 


Leu 


Trp 


► Glu Asp 


Gin 


. Gly 


■ Gly Leu Leu 


. Gly 


Pro 


• Phe 


; Ser 


1 








5 








10 








15 


Phe 


Leu 


Leu 


Leu 


. Val Leu 


Leu 


Leu 


Val 


Thr Arg 


Ser 


Pro 


Val 


Asn 










20 








25 








30 


Ala 


Cys 


Leu 


Leu 


Thr Gly 


Ser 


Leu 


Phe 


Val Leu 


Leu 


Arg 


Val 


Phe 










35 








40 








45 


Ser 


Phe 


Glu 


Pro 


Val Pro 


Ser 


Cys 


Arg 


Ala Leu 


Gin 


Val 


Leu 


Lys 










50 








55 








60 


Pro 


Arg 


Asp 


Arg 


lie Ser 


Ala 


lie 


Ala 


His Arg 


Gly 


Gly 


. Ser 


His 










65 








70 








75 


Asp 


Ala 


Pro 


Glu 


Asn Thr 


Leu 


Ala 


Ala 


He Arg 


Gin 


Ala 


Ala 


Lys 










80 








85 








90 


Asn 


Gly 


Ala 


Thr 


Gly Val 


Glu 


Leu 


Asp 


He Glu 


Phe 


Thr 


Ser 


Asp 










95 








100 








105 


Gly 


He 


Pro 


Val 


Leu Met 


His 


Asp 


Asn 


Thr Val 


Asp 


Arg 


Thr 


Thr 










110 








115 








120 


Asp 


Gly 


Thr 


Gly 


Arg Leu 


Cys 


Asp 


Leu 


Thr Phe 


Glu 


Gin 


He 


Arg 










125 








130 








135 


Lys 


Leu 


Asn 


Pro 


Ala Ala 


Asn 


His 


Arg 


Leu Arg 


Asn 


Asp 


Phe 


Pro 










140 








145 








150 


Asp 


Glu 


Lys 


He 


Pro Thr 


Leu 


Arg 


Glu 


Ala Val 


Ala 


Glu 


Cys 


Leu 










155 








160 








165 


As ri 


His 


Asn 


Leu 


Thr lie 


Phe 


Phe 


Asp 


Val Lys 


Gly 


His 


Ala 


His 










170 








175 








180 


Lys 


Ala 


Thr 


Glu 


Ala Leu 


Lys 


Lys 


Met 


Tyr Met 


Glu 


Phe 


Pro 


Gin 










185 








190 








195 


Leu 


Tyr 


Asn 


Asn 


Ser Val 


Val 


Cys 


Ser 


Phe Leu 


Pro 


Glu 


Val 


He 










200 








205 








210 


Tyr 


Lys 


Met 


Arg 


Gin Thr 


Asp 


Arg 


Asp 


Val He 


Thr 


Ala 


Leu 


Thr 










215 








220 








225 


His 


Arg 


Pro 


Trp 


Ser Leu 


Ser 


His 


Thr 


Gly Asp 


Gly 


Lys 


Pro 


Arg 










230 








235 








240 


Tyr 


Asp 


Thr 


Phe 


Trp Lys 


His 


Phe 


He 


Phe Val 


Met 


Met 


Asp 


He 










245 








250 








255 


Leu 


Leu 


Asp 


Trp 


Ser Met 


His 


Asn 


He 


Leu Trp 


Tyr 


Leu 


Cys 


Gly 










260 








265 








270 


lie 


Ser 


Ala 


Phe 


Leu Met 


Gin 


Lys 


Asp 


Phe Val 


Ser 


Pro 


Ala 


Tyr 










275 








280 








285 


Leu 


Lys 


Lys 


Trp 


Ser Ala 


Lys 


Gly 


He 


Gin Val 


Val 


Gly 


Trp 


Thr 










290 








295 








300 


Val 


Asn 


Thr 


Phe 


Asp Glu 


Lys 


Ser 


Tyr 


Tyr Glu 


Ser : 


His 


Leu 


Gly 
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305 310 315 

Ser Ser Tyr lie Thr Asp Ser Met Val Glu Asp Cys Glu Pro His 

320 325 330 

Phe 



<210> 4 
<211> 153 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 1553276CD1 

<400> 4 

Met Ala Ala Ala Leu Ala Leu Val Ala Gly Val Leu Ser Gly Ala 
15 10 15 

Val Leu Pro Leu Trp Ser Ala Leu Pro Gin Tyr Lys Lys Lys lie 

20 25 30 

Thr Asp Arg Cys Phe His His Ser Glu Cys Tyr Ser Gly Cys Cys 

35 40 45 

Leu Met Asp Leu Asp Ser Gly Gly Ala Phe Cys Ala Pro Arg Ala 

50 55 60 

Arg He Thr Met He Cys Leu Pro Gin Trp Leu Glu Leu Phe Lys 

65 70 75 

Gly Arg Asp Cys He He Phe He Tyr Glu Ala Pro Thr Pro Ser 

80 85 90 

Leu Val Ser Ala His Asn Gin Gly Ser Tyr Gin His His Leu Pro 

95 100 105 

Leu Pro Asp Gly Leu Asp Val His He Gin Gly Leu Asp Val Phe 
HO 115 120 

Pro Pro Val Pro Tyr Asp Leu Glu Glu Asp Ala Gly Trp Ser Leu 
125 130 135 

Leu Pro Trp Gly His Arg Pro Trp Leu Pro Pro Thr Cys Ser Lys 
140 145 150 

Ser Ser Ser 



<210> 5 
<211> 571 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No; 1702211CD1 



<400> 5 

Met Glu Arg Ala Val Arg Val Glu 

1 5 
Val Cys Leu Leu Leu Ala Cys Pro 
20 

Val Ala Gin Pro Glu Val Asp Thr 
35 



Ser Gly Val Leu Val Gly Val 
10 15 

Ala" Thr Ala Thr Gly Pro Glu 
25 30 

Thr Leu Gly Arg Val Arg Gly 
40 45 
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Arg Gin Val Gly Val Lys Gly Thr Asp Arg Leu Val Asn Val Phe 
50 55 60 

Leu Gly lie Pro Phe Ala Gin Pro Pro Leu Gly Pro Asp Arg Phe 
65 70 75 

Ser Ala Pro His Pro Ala Gin Pro Trp Glu Gly Val Arg Asp Ala 
80 85 90 

Ser Thr Ala Pro Pro Met Cys Leu Gin Asp Val Glu Ser Met Asn 
95 100 105 

Ser Ser Arg Phe Val Leu Asn Gly Lys Gin Gin lie Phe Ser Val 
110 115 120 

Ser Glu Asp Cys Leu Val Leu Asn Val Tyr Ser Pro Ala Glu Val 
125 130 135 

Pro Ala Gly Ser Gly Arg Pro Val Met Val Trp Val His Gly Gly 
14 0 145 150 

Ala Leu lie Thr Gly Ala Ala Thr Ser Tyr Asp Gly Ser Ala Leu 
155 160 165 

Ala Ala Tyr Gly Asp Val Val Val Val Thr Val Gin Tyr Arg Leu 
170 175 180 

Gly Val Leu Gly Phe Phe Ser Thr Gly Asp Glu His Ala Pro Gly 
185 190 !95 

Asn Gin Gly Phe Leu Asp Val Val Ala Ala Leu Arg Trp Val Gin 
200 205 210 

Glu Asn He Ala Pro Phe Gly Gly Asp Leu Asn Cys Val Thr Val 
215 220 225 

Phe Gly Gly Ser Ala Gly Gly Ser He He Ser Gly Leu Val Leu 
230 235 240 

Ser Pro Val Ala Ala Gly Leu Phe His Arg Ala lie Thr Gin Ser 
245 250 255 

Gly Val lie Thr Thr Pro Gly He He Asp Ser His Pro Trp Pro 
260 265 270 

Leu Ala Gin Lys He Ala Asn Thr Leu Ala Cys Ser Ser Ser Ser 
275 280 285 

Pro Ala Glu Met Val Gin Cys Leu Gin Gin Lys Glu Gly Glu Glu 
290 295 300 

Leu Val Leu Ser Lys Lys Leu Lys Asn Thr He Tyr Pro Leu Thr 
305 310 315 

Val Asp Gly Thr Val Phe Pro Lys Ser Pro Lys Glu Leu Leu Lys 
320 325 330 

Glu Lys Pro Phe His Ser Val Pro Phe Leu Met Gly Val Asn Asn 
335 340 345 

His Glu Phe Ser Trp Leu He Pro Arg Gly Trp Gly Leu Leu Asp 
350 355 360 

Thr Met Glu Gin Met Ser Arg Glu Asp Met Leu Ala He Ser Thr 
365 370 375 

Pro Val Leu Thr Ser Leu Asp Val Pro Pro Glu Met Met Pro Thr 
380 385 390 

Val He Asp Glu Tyr Leu Gly Ser Asn Ser Asp Ala Gin Ala Lys 
395 400 405 

Cys Gin Ala Phe Gin Glu Phe Met Gly Asp Val Phe He Asn Val 
410 415 420 

Pro Thr Val Ser Phe Ser Arg Tyr Leu Arg Asp Ser Gly Ser Pro 
425 430 435 

Val Phe Phe Tyr Glu Phe Gin His Arg Pro Ser Ser Phe Ala Lys 
440 445 450 

He Lys Pro Ala Trp Val Lys Ala Asp His Gly Ala Glu Gly Ala 
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455 










460 










465 


Phe 


Val 


Phe 


Gly 


Gly 
470 


Pro 


Phe 


Leu 


Met 


Asp 
475 


Glu 


Ser 


Ser 


Arg 


Leu 
480 


Ala 


Phe 


Pro 


Glu 


Ala 
485 


Thr 


Glu 


Glu 


Glu 


Lys 
490 


Gin 


Leu 


Ser 


Leu 


Thr 
495 


Met 


Met 


Ala 


Gin 


Trp 
500 


Thr 


His 


Phe 


Ala 


Arg 
505 


Thr 


Gly 


Asp 


Pro 


Asn 
510 


Ser 


Lys 


Ala 


Leu 


Pro 
515 


Pro 


Trp 


Pro 


Gin 


Phe 
520 


Asn 


Gin 


Ala 


Glu 


Gin 
525 


Tyr 


Leu 


Glu 


He 


Asn 
530 


Pro 


Val 


Pro 


Arg 


Ala 
535 


Gly 


Gin 


Lys 


Phe 


Arg 
540 


Glu 


Ala 


Trp 


Met 


Gin 
545 


Phe 


Trp 


Ser 


Glu 


Thr 
550 


Leu 


Pro 


Ser 


Lys 


He 
555 


Gin 


Gin 


Trp 


His 


Gin 
560 


Lys 


Gin 


Lys 


Asn 


Arg 
565 


Lys 


Ala 


Gin 


Glu 


Asp 
570 



Leu 



<210> 6 
<211> 347 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 1859618CD1 

<400> 6 



Met 


Ser 


Ser 


Trp 


Ser 


Arg 


Gin 


Arg 


Pro 


Lys 


Ser 


Pro 


Gly Gly 


He 


l 








5 










10 










15 


Gin 


Pro 


His 


Val 


Ser 
20 


Arg 


Thr 


Leu 


Phe 


Leu 
25 


Leu 


Leu 


Leu 


Leu 


Ala 
30 


Ala 


Ser 


Ala 


Trp 


Gly 
35 


Val 


Thr 


Leu 


Ser 


Pro 
40 


Lys 


Asp 


Cys 


Gin 


Val 
45 


Phe 


Arg 


Ser 


Asp 


His 
50 


Gly 


Ser 


Ser 


He 


Ser 

55 


Cys 


Gin 


Pro 


Pro 


Ala 
60 


Glu 


He 


Pro 


Gly 


Tyr 
65 


Leu 


Pro 


Ala 


Asp 


Thr 
70 


Val 


His 


Leu 


Ala 


Val 
75 


Glu 


Phe 


Phe 


Asn 


Leu 
80 


Thr 


His 


Leu 


Pro 


Ala 
85 


Asn 


Leu 


Leu 


Gin 


Gly 
90 


Ala 


Ser 


Lys 


Leu 


Gin 
95 


Glu 


Leu 


His 


Leu 


Ser 
100 


Ser 


Asn 


Gly 


Leu 


Glu 
105 


Ser 


Leu 


Ser 


Pro 


Glu 
110 


Phe 


Leu 


Arg 


Pro 


Val 
115 


Pro 


Gin 


Leu 


Arg 


Val 
120 


Leu 


Asp 


Leu 


Thr 


Arg 
125 


Asn 


Ala 


Leu 


Thr 


Gly 
130 


Leu 


Pro 


Pro 


Gly 


Leu 
135 


Phe 


Gin 


Ala 


Ser 


Ala 
140 


Thr 


Leu 


Asp 


Thr 


Leu 
145 


Val 


Leu 


Lys 


Glu 


Asn 
150 


Gin 


Leu 


Glu 


Val 


Leu 
155 


Glu 


Val 


Ser 


Trp 


Leu 
160 


His 


Gly 


Leu 


Lys 


Ala 
165 


Leu 


Gly His 


Leu 


Asp 


Leu 


Ser 


Gly Asn 


Arg 


Leu 


Arg 


Lys 


Leu 


Pro 










170 










175 










180 


Pro 


Gly 


Leu 


Leu 


Ala 
185 


Asn 


Phe 


Thr 


Leu 


Leu 
190 


Arg 


Thr 


Leu 


Asp 


Leu 
195 


Gly 


Glu 


Asn 


Gin 


Leu 


Glu 


Thr 


Leu 


Pro 


Pro 


Asp 


Leu 


Leu 


Arg 


Gly 
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200 










205 










210 


Pro 


Leu 


Gin 


Leu 


Glu 
215 


Arg 


Leu 


His 


Leu 


Glu 

220 


Gly 


Asn 


Lys 


Leu 


Gin 
225 


Val 


Leu 


Gly 


Lys 


Asp 
230 


Leu 


Leu 


Leu 


Pro 


Gin 
235 


Pro 


Asp 


Leu 


Arg 


Tyr 
240 


Leu 


Phe 


Leu 


Asn 


Gly 
245 


Asn 


Lys 


Leu 


Ala 


Arg 
250 


Val 


Ala 


Ala 


Gly 


Ala 
255 


Phe 


Gin 


Gly 


Leu 


Arg 
260 


Gin 


Leu 


Asp 


Met 


Leu 
265 


Asp 


Leu 


Ser 


Asn 


Asn 
270 


Ser 


Leu 


Ala 


Ser 


Val 
275 


Pro 


Glu 


Gly 


Leu 


Trp 
280 


Ala 


Ser 


Leu 


Gly 


Gin 
285 


Pro 


Asn 


Trp 


Asp 


Met 
290 


Arg 


Asp 


Gly 


Phe 


Asp 
295 


He 


Ser 


Gly 


Asn 


Pro 
300 


Trp 


He 


Cys 


Asp 


Gin 
305 


Asn 


Leu 


Ser 


Asp 


Leu 
310 


Tyr 


Arg 


Trp 


Leu 


Gin 
315 


Ala 


Gin 


Lys 


Asp 


Lys 
320 


Met 


Phe 


Ser 


Gin 


Asn 
325 


Asp 


Thr 


Arg 


Cys 


Ala 
330 


Gly 


Pro 


Glu 


Ala 


Val 
335 


Lys 


Gly 


Gin 


Thr 


Leu 
340 


Leu 


Ala 


Val 


Ala 


Lys 
345 


Ser 


Gin 





























<210> 7 
<211> 194 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No : 2011071CD1 

<400> 7 



Met 


Gin 


Asp 


Ala 


Pro 


Leu 


Ser 


Cys 


Leu 


Ser 


Pro 


Thr 


Arg 


Trp 


Ser 


1 








5 










10 










15 


Ser 


Val 


Ser 


Ser 


Ala 


Asp 


Ser 


Thr 


Glu 


Lys 


Ser 


Ala 


Ser 


Gly Ala 










20 










25 










30 


Gly 


Thr 


Arg 


Asn 


Leu 


Pro 


Phe 


Gin 


Phe 


Cys 


Leu 


Arg 


Gin 


Ala 


Leu 










35 










40 










45 


Arg 


Met 


Lys 


Ala 


Ala 


Gly 


He 


Leu 


Thr 


Leu 


He 


Gly 


Cys 


Leu 


Val 










50 










55 










60 


Thr 


Gly 


Ala 


Glu 


Ser 


Lys 


He 


Tyr 


Thr 


Arg 


Cys 


Lys 


Leu 


Ala 


Lys 










65 










70 










75 


He 


Phe 


Ser 


Arg 


Ala 


Gly 


Leu 


Asp 


Asn 


Tyr 


Trp Gly 


Phe 


Ser 


Leu 










80 










85 










90 


Gly Asn 


Trp 


He 


Cys 


Met 


Ala 


Tyr 


Tyr 


Glu 


Ser 


Gly 


Tyr 


Asn 


Thr 










95 










100 










105 


Thr 


Ala 


Pro 


Thr 


Val 


Leu 


Asp 


Asp 


Gly 


Ser 


He 


Asp 


Tyr 


Gly 


He 










110 










115 










120 


Phe 


Gin 


He 


Asn 


Thr 


Phe 


Ala 


Trp 


Cys 


Arg 


Arg 


Gly 


Lys 


Leu 


Lys 










125 










130 










135 


Glu 


Asn 


Asn 


His 


Cys 


His 


Val 


Ala 


Cys 


Ser 


Ala 


Leu 


He 


Thr 


Asp 










140 










14 5 










150 


Asp 


Leu 


Thr 


Asp 


Ala 


He 


He 


Cys 


Ala 


Arg 


Lys 


He 


Val 


Lys 


Glu 










155 










160 










165 


Thr 


Gin 


Gly Met 


Asn 


Tyr 


Trp 


Gin 


Gly 


Trp 


Lys 


Lys 


His 


Cys 


Glu 
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170 175 180 

Gly Arg Asp Leu Ser Glu Trp Lys Lys Gly Cys Glu Val Ser 
185 190 



<210> 8 

<211> 361 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 2186517CD1 



<400> 8 



Met 


Ala 


Trp 


Gin 


Glv 


TrD 


Pro 


Ala 


Ala 


TrD 


Gin 


Tm 


Val 


Ala 


Gly 


1 








5 










1 0 










_L _> 


Cys 


TrD 


Leu 


Leu 


Leu 


Val 


Leu 


Val 


Leu 


Val 


Leu 


Leu 


Val 


Ser 


Pro 










20 










2 5 












Arg 


Gly 


Cys 


Arg 


Ala 


Arcr 


Ara 


Glv 


Leu 


Arcr 


Glv 


Leu 


Leu 


Met 


Ala 










35 










4 0 












His 


Ser 


Gin 


Arg 


Leu 


Leu 


Phe 


Arg 


lie 


Glv 
u j> j 


Tvr 


Ser 


Leu 


Tvr 

-l y x 


Thr 










50 










55 










6 0 


Arcr 


Thr 


Trp 


Leu 


Glv 


Tvr 


Leu 


Phe 


Tvr 


Arg 


Gin 


Gin 


Leu 


Arg 












65 










70 










75 


Ala 


Arg 


Asn 


Arg 


Tyr 


Pro 


Lys 


Gly 


His 


Ser 


Lys 


Thr 


Gin 


Thr 


Arcr 










80 










85 










90 


Leu 


Phe 


Asn 


Gly 


Val 


Lys 


Val 


Leu 


Pro 


lie 


Pro 


Val 


Leu 


Ser 


Asp 










95 










100 










105 


Asn 


Tyr 


Ser 


Tyr 


Leu 


He 


He 


Asp 


Thr 


Gin 


Ala 


Gin 


Leu 


Ala 


Val 










110 










115 










120 


Ala 


Val 


Asp 


Pro 


Ser 


Asp 


Pro 


Arg 


Ala 


Val 


Gin 


Ala 


Ser 


He 


Glu 










125 










130 










135 


Lys 


Glu 


Gly Val 


Thr 


Leu 


Val 


Ala 


lie 


Leu 


Cys 


Thr 


His 


Lys 


His 










140 










145 










150 


Trp 


Asp 


His 


Ser 


Gly 


Gly 


Asn 


Arg 


Asp 


Leu 


Ser 


Arg 


Arg 


His 


Arg 










155 










160 










165 


Asp 


Cys 


Arg 


Val 


Tyr' 


Gly 


Ser 


Pro 


Gin 


Asp 


Gly 


lie 


Pro 


Tyr 


Leu 










170 










175 










180 


Thr 


His 


Pro 


Leu 


Cys 


His 


Gin 


Asp 


Val 


Val 


Ser 


Val 


Gly 


Arg 


Leu 










185 










190 










195 


Gin 


He 


Arg 


Ala 


Leu 


Ala 


Thr 


Pro 


Gly 


His 


Thr 


Gin 


Gly 


His 


Leu 










200 










205 










210 


Val 


Tyr 


Leu 


Leu 


Asp 


Gly 


Glu 


Pro 


Tyr 


Lys 


Gly 


Pro 


Ser 


Cys 


Leu 










215 










220 










225 


Phe 


Ser 


Gly Asp 


Leu 


Leu 


Phe 


Leu 


Ser 


Gly 


Cys 


Gly Arg 


Thr 


Phe 










230 










235 










240 


Glu 


Gly 


Asn 


Ala 


Glu 


Thr 


Met 


Leu 


Ser 


Ser 


Leu 


Asp 


Thr 


Val 


Leu 










245 










250 










255 


Gly 


Leu 


Gly Asp Asp 


Thr 


Leu 


Leu 


Trp 


Pro 


Gly 


His 


Glu 


Tyr 


Ala 










260 










265 










270 


Glu 


Glu 


Asn 


Leu 


Gly 


Phe 


Ala 


Gly 


Val 


Val 


Glu 


Pro 


Glu 


Asn 


Leu 










275 










280 










285 


Ala 


Arg 


Glu 


Arg 


Lys 


Met 


Gin 


Trp 


Val 


Gin 


Arg 


Gin 


Arg 


Leu 


Glu 










290 










295 










300 
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Arg Lys Gly Thr Cys Pro Ser Thr Leu Gly Glu Glu Arg Ser Tyr 

305 310 315 

Asn Pro Phe Leu Arg Thr His Cys Leu Ala Leu Gin Glu Ala Leu 

320 325 330 

Gly Pro Gly Pro Gly Pro Thr Gly Asp Asp Asp Tyr Ser Arg Ala 

335 340 345 

Gin Leu Leu Glu Glu Leu Arg Arg Leu Lys Asp Met His Lys Ser 

350 355 360 

Lys 



<210> 9 
<211> 306 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 2253585CD1 

<400> 9 

Met Leu Arg Trp Thr Arg Ala Trp Arg Leu Pro Arg Glu Gly Leu 
15 10 15 

Gly Pro His Gly Pro Ser Phe Ala Arg Val Pro Val Ala Pro Ser 
20 25 30 

Ser Ser Ser Gly Gly Arg Gly Gly Ala Glu Pro Arg Pro Leu Pro 
35 40 45 

Leu Ser Tyr Arg Leu Leu Asp Gly Glu Ala Ala Leu Pro Ala Val 
50 55 60 

Val Phe Leu His Gly Leu Phe Gly Ser Lys Thr Asn Phe Asn Ser 
65 70 75 

lie Ala Lys lie Leu Ala Gin Gin Thr Gly Arg Arg Val Leu Thr 
80 85 90 

Val Asp Ala Arg Asn His Gly Asp Ser Pro His Ser Pro Asp Met 
95 100 105 

Ser Tyr Glu lie Met Ser Gin Asp Leu Gin Asp Leu Leu Pro Gin 
110 115 120 

Leu Gly Leu Val Pro Cys Val Val Val Gly His Ser Met Gly Gly 
125 130 135 

Lys Thr Ala Met Leu Leu Ala Leu Gin Arg Pro Glu Leu Val Glu 
140 145 150 

Arg Leu He Ala Val Asp He Ser Pro Val Glu Ser Thr Gly Val 
155 160 165 

Ser His Phe Ala Thr Tyr Val Ala Ala Met Arg Ala He Asn He 
170 175 180 

Ala Asp Glu Leu Pro Arg Ser Arg Ala Arg Lys Leu Ala Asp Glu 
185 190 195 

Gin Leu Ser Ser Val He Gin Asp Met Ala Val Arg Gin His Leu 
200 205 210 

Leu Thr Asn Leu Val Glu Val Asp Gly Arg Phe Val Trp Arg Val 
215 220 225 

Asn Leu Asp Ala Leu Thr Gin His Leu Asp Lys He Leu Ala Phe 
230 235 240 

Pro Gin Arg Gin Glu Ser Tyr Leu Gly Pro Thr Leu Phe Leu Leu 
245 250 255 
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Gly Gly Asn Ser Gin Phe Val His Pro Ser His His Pro Glu He 

260 265 270 

Met Arg Leu Phe Pro Arg Ala Gin Met Gin Thr Val Pro Asn Ala 

275 280 285 

Gly His Trp He His Ala Asp Arg Pro Gin Asp Phe lie Ala Ala 

290 295 300 
He Arg Gly Phe Leu Val 

305 



<210> 10 

<2ll> 483 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2447520CD1 

<400> 10 

Met Ser Asn Lys Leu Leu Ser Pro His Pro His Ser Val Val Leu 
15 10 15 

Arg Ser Glu Phe Lys Met Ala Ser Ser Pro Ala Val Leu Arg Ala 
20 25 30 

Ser Arg Leu Tyr Gin Trp Ser Leu Lys Ser Ser Ala Gin Phe Leu 
35 40 45 

Gly Ser Pro Gin Leu Arg Gin Val Gly Gin He lie Arg Val Pro 
50 55 60 

Ala Arg Met Ala Ala Thr Leu He Leu Glu Pro Ala Gly Arg Cys 
65 70 75 

Cys Trp Asp Glu Pro Val Arg lie Ala Val Arg Gly Leu Ala Pro 
80 85 90 

Glu Gin Pro Val Thr Leu Arg Ala Ser Leu Arg Asp Glu Lys Gly 
95 100 105 

Ala Leu Phe Gin Ala His Ala Arg Tyr Arg Ala Asp Thr Leu Gly 
HO 115 120 

Glu Leu Asp Leu Glu Arg Ala Pro Ala Leu Gly Gly Ser Phe Ala 
125 130 135 

Gly Leu Glu Pro Met Gly Leu Leu Trp Ala Leu Glu Pro Glu Lys 
140 145 150 

Pro Leu Val Arg Leu Val Lys Arg Asp Val Arg Thr Pro Leu Ala 
155 160 165 

Val Glu Leu Glu Val Leu Asp Gly His Asp Pro Asp Pro Gly Arg 
170 175 180 

Leu Leu Cys Gin Thr Arg His Glu Arg Tyr Phe Leu Pro Pro Gly 
185 190 195 

Val Arg Arg Glu Pro Val Arg Val Gly Arg Val Arg Gly Thr Leu 
200 205 210 

Phe Leu Pro Pro Glu Pro Gly Pro Phe Pro Gly He Val Asp Met 
215 220 225 

Phe Gly Thr Gly Gly Gly Leu Leu Glu Tyr Arg Ala Ser Leu Leu 
230 235 240 

Ala Gly Lys Gly Phe Ala Val Met Ala Leu Ala Tyr Tyr Asn Tyr 
245 250 255 

Glu Asp Leu Pro Lys Thr Met Glu Thr Leu His Leu Glu Tyr Phe 
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260 265 270 

Glu Glu Ala Met Asn Tyr Leu Leu Ser His Pro Glu Val Lys Gly 
275 280 285 

Pro Gly Val Gly Leu Leu Gly lie Ser Lys Gly Gly Glu Leu Cys 
290 295 300 

Leu Ser Met Ala Ser Phe Leu Lys Gly lie Thr Ala Ala Val Val 
305 310 315 

lie Asn Gly Ser Val Ala Asn Val Gly Gly Thr Leu Arg Tyr Lys 
320 325 330 

Gly Glu Thr Leu Pro Pro Val Gly Val Asn Arg Asn Arg lie Lys 
335 340 345 

Val Thr Lys Asp Gly Tyr Ala Asp lie Val Asp Val Leu Asn Ser 
350 355 360 

Pro Leu Glu Gly Pro Asp Gin Lys Ser Phe lie Pro Val Glu Arg 
365 370 375 

Ala Glu Ser Thr Phe Leu Phe Leu Val Gly Gin Asp Asp His Asn 
380 385 390 

Trp Lys Ser Glu Phe Tyr Ala Asn Glu Ala Cys Lys Arg Leu Gin 
395 400 405 

Ala His Gly Arg Arg Lys Pro Gin lie lie Cys Tyr Pro Glu Thr 
410 415 420 

Gly His Tyr lie Glu Pro Pro Tyr Phe Pro Leu Cys Arg Ala Ser 
425 430 435 

Leu His Ala Leu Val Gly Ser Pro lie lie Trp Gly Gly Glu Pro 
440 445 450 

Arg Ala His Ala Met Ala Gin Val Asp Ala Trp Lys Gin Leu Gin 
455 460 465 

Thr Phe Phe His Lys His Leu Gly Gly His Glu Gly Thr lie Pro 
470 475 480 

Ser Lys Val 



<210> 11 
<211> 144 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2481345CD1 

<400> 11 

Met Leu Leu Leu Trp Val Ser Val Val Ala Ala Leu Ala Leu Ala 
15 10 15 

Val Leu Ala Pro Gly Ala Gly Glu Gin Arg Arg Arg Ala Ala Lys 

20 25 30 

Ala Pro Asn Val Val Leu Val Val Ser Asp Ser Phe Asp Gly Arg 

35 40 45 

Leu Thr Phe His Pro Gly Ser Gin Val Val Lys Leu Pro Phe lie 

50 55 60 

Asn Phe Met Lys Thr Arg Gly Thr Ser Phe Leu Asn Ala Tyr Thr 

65 ' 70 75 

Asn Ser Pro lie Cys Cys Pro Ser Arg Ala Ala Met Trp Ser Gly 

80 85 90 

Leu Phe Thr His Leu Thr Glu Ser Trp Asn Asn Phe Lys Gly Leu 
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95 

Asp Pro Asn Tyr Thr Thr Trp Met 
110 

Tyr Arg Thr Gin Lys Phe Gly Lys 
. 125 

His Ser lie Ser Asn Arg Val Glu 
140 



100 105 

Asp Val Met Glu Arg His Gly 

115 120 

Leu Asp Tyr Thr Ser Gly His 

130 135 

Ala 



<210> 12 
<211> 180 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<22 3> Incyte ID No ; 24 84 02 0CD1 



<400> 12 



Met 


Met 


Lys 


Phe 


Lys 


Pro 


Asn 


Gin 


Thr 


Arg 


Thr 


Tyr 


Asp 


Arg 


Glu 


1 








5 










10 








15 


Gly 


Phe 


Lys 


Lys 


Arg 
20 


Ala 


Ala 


Cys 


Leu 


Cys 
25 


Phe 


Arg 


Ser 


Glu 


Gin 
30 


Glu 


Asp 


Glu 


Val 


Leu 
35 


Leu 


Val 


Ser 


Ser 


Ser 
40 


Arg 


Tyr 


Pro 


Asp 


Gin 
45 


Trp 


He 


Val 


Pro 


Gly 
50 


Gly 


Gly 


Met 


Glu 


Pro 
55 


Glu 


Glu 


Glu 


Pro 


Gly 
60 


Gly Ala 


Ala 


Val 


Arg 


Glu 


Val 


Tyr 


Glu 


Glu 


Ala 


Gly Val 


Lys 


Gly 










65 










70 










75 


Lys 


Leu 


Gly Arg 


Leu 


Leu 


Gly 


He 


Phe 


Glu 


Asn 


Gin 


Asp 


Arg 


Lys 


His 








80 










85 










90 


Arg 


Thr 


Tyr 


Val 
95 


Tyr 


Val 


Leu 


Thr 


Val 
100 


Thr 


Glu 


He 


Leu 


Glu 
105 


Asp 


Trp 


Glu 


Asp 


Ser 
110 


Val 


Asn 


He 


Gly 


Arg 
115 


Lys 


Arg 


Glu 


Trp 


Phe 
120 


Lys 


Val 


Glu 


Asp 


Ala 
125 


He 


Lys 


Val 


Leu 


Gin 
130 


Cys 


His 


Lys 


Pro 


Val 
135 


His 


Ala 


Glu 


Tyr 


Leu 
140 


Glu 


Lys 


Leu 


Lys 


Leu 
145 


Gly 


Cys 


Ser 


Pro 


Ala 
150 


Asn 


Gly Asn 


Ser 


Thr 


Val 


Pro 


Ser 


Leu 


Pro 


Asp 


Asn 


Asn 


Ala 


Leu 










155 










160 










165 


Phe 


Val 


Thr 


Ala 


Ala 
170 


Gin 


Thr 


Ser 


Gly 


Leu 

175 


Pro 


Ser 


Ser 


Val 


Arg 
180 



<210> 13 
<211> 375 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: 2862528CD1 
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<400> 13 

Met Ala Arg Pro Gly Leu lie His Ser Ala Pro Gly Leu Pro Asp 
15 10 15 

Thr Cys Ala Leu Leu Gin Pro Pro Ala Ala Ser Ala Ala Ala Ala 
20 25 30 

Pro Ser Met Ser Gly Pro Asp Val Glu Thr Pro Ser Ala lie Gin 
35 40 45 

lie Cys Arg lie Met Arg Pro Asp Asp Ala Asn Val Ala Gly Asn 
50 55 60 

Val His Gly Gly Thr lie Leu Lys Met lie Glu Glu Ala Gly Ala 
65 70 75 

lie lie Ser Thr Arg His Cys Asn Ser Gin Asn Gly Glu Arg Cys 
80 85 90 

Val Ala Ala Leu Ala Arg Val Glu Arg Thr Asp Phe Leu Ser Pro 
95 100 105 

Met Cys He Gly Glu Val Ala His Val Ser Ala Glu He Thr Tyr 
110 115 120 

Thr Ser Lys His Ser Val Glu Val Gin Val Asn Val Met Ser Glu 
125 130 135 

Asn lie Leu Thr Gly Ala Lys Lys Leu Thr Asn Lys Ala Thr Leu 
140 145 150 

Trp Tyr Val Pro Leu Ser Leu Lys Asn Val Asp Lys Val Leu Glu 
155 160 165 

Val Pro Pro Val Val Tyr Ser Arg Gin Glu Gin Glu Glu Glu Gly 
170 175 180 

Arg Lys Arg Tyr Glu Ala Gin Lys Leu Glu Arg Met Glu Thr Lys 
185 190 195 

Trp Arg Asn Gly Asp lie Val Gin Pro Val Leu Asn Pro Gly Val 
200 205 210 

Thr Met Lys Leu Met Asp Glu Val Ala Gly lie Val Ala Ala Arg 
215 220 225 

His Cys Lys Thr Asn He Val Thr Ala Ser Val Asp Ala lie Asn 
230 235 240 

Phe His Asp Lys He Arg Lys Gly Cys Val He Thr He Ser Gly 
245 250 255 

Arg Met Thr Phe Thr Ser Asn Lys Ser Met Glu He Glu Val Leu 
260 265 270 

Val Asp Ala Asp Pro Val Val Asp Ser Ser Gin Lys Arg Tyr Arg 
275 280 285 

Ala Ala Ser Ala Phe Phe Thr Tyr Val Ser Leu Ser Gin Glu Gly 
290 295 300 

Arg Ser Leu Pro Val Pro Gin Leu Val Pro Glu Thr Glu Asp Glu 
305 310 315 

Lys Lys Arg Phe Glu Glu Gly Lys Gly Arg Tyr Leu Gin Met Lys 
320 325 330 

Ala Asn Asp Arg Ala Thr Arg Ser Leu Ser Pro Arg Leu Pro Pro 
335 340 345 

Pro Ala Thr Gly Ala Ser Ser Ser His Gly Asn Gly Pro Ser Val 
350 355 360 

Gin Ser Leu Arg Ser Ser Pro Leu Gly Gin Lys Pro Asn Ser His 
365 370 375 



<210> 14 
<211> 637 
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<212> PRT 

<213> Homo sapiens 
<220> 

<22l> misc_feature 

<223> Incyte ID No: 3200650CD1 

<400> 14 

Met Thr Thr Trp Ser Leu Arg Arg Arg Pro Ala Arg Thr Leu Gly 
15 10 15 

Leu Leu Leu Leu Val Val Leu Gly Phe Leu Val Leu Arg Arg Leu 
20 25 30 

Asp Trp Ser Thr Leu Val Pro Leu Arg Leu Arg His Arg Gin Leu 
35 40 45 

Gly Leu Gin Ala Lys Gly Trp Asn Phe Met Leu Glu Asp Ser Thr 
50 55 60 

Phe Trp lie Phe Gly Gly Ser lie His Tyr Phe Arg Val Pro Arg 
65 70 75 

Glu Tyr Trp Arg Asp Arg Leu Leu Lys Met Lys Ala Cys Gly Leu 
80 85 90 

Asn Thr Leu Thr Thr Tyr Val Pro Trp Asn Leu His Glu Pro Glu 
95 100 105 

Arg Gly Lys Phe Asp Phe Leu Trp Glu Thr Trp Thr Leu Lys Ala 

110 115 120 

Phe Val Leu Met Ala Ala Glu lie Gly Leu Trp Val lie Leu Arg 

125 130 135 

Pro Gly Pro Tyr lie Cys Ser Glu Met Asp Leu Gly Gly Leu Pro 

140 145 150 

Ser Trp Leu Leu Gin Asp Pro Gly Met Arg Leu Arg Thr Thr Tyr , 

155 160 165 

Lys Gly Phe Thr Glu Ala Val Asp Leu Tyr Phe Asp His Leu Met 

170 175 180 

Ser Arg Val Val Pro Leu Gin Tyr Lys Arg Gly Gly Pro lie lie 

185 190 195 

Ala Val Gin Val Glu Asn Glu Tyr Gly Ser Tyr Asn Lys Asp Pro 

200 205 210 

Ala Tyr Met Pro Tyr Val Lys Lys Ala Leu Glu Asp Arg Gly lie 

215 220 225 

Val Glu Leu Leu Leu Thr Ser Asp Asn Lys Asp Gly Leu Ser Lys 

230 235 240 

Gly lie Val Gin Gly Val Leu Ala Thr lie Asn Leu Gin Ser Thr 

245 250 255 

His Glu Leu Gin Leu Leu Thr Thr Phe Leu Phe Asn Val Gin Gly 

260 265 270 

Thr Gin Pro Lys Met Val Met Glu Tyr Trp Thr Gly Trp Phe Asp 

275 280 285 

Ser Trp Gly Gly Pro His Asn lie Leu Asp Ser Ser Glu Val Leu 

290 295 300 

Lys Thr Val Ser Ala lie Val Asp Ala Gly Ser Ser lie Asn Leu 

305 310 315 

Tyr Met Phe His Gly Gly Thr Asn Phe Gly Phe Met Asn Gly Ala 

320 ' 325 330 

Met His Phe His Asp Tyr Lys Ser Asp Val Thr Ser Tyr Asp Tyr 

335 340 345 

Asp Ala Val Leu Thr Glu Ala Gly Asp Tyr Thr Ala Lys Tyr Met 
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350 355 360 

Lys Leu Arg Asp Phe Phe Gly Ser lie Ser Gly lie Pro Leu Pro 

365 370 375 

Pro Pro Pro Asp Leu Leu Pro Lys Met Pro Tyr Glu Pro Leu Thr 

380 385 390 

Pro Val Leu Tyr Leu Ser Leu Trp Asp Ala Leu Lys Tyr Leu Gly 

395 400 405 

Glu Pro lie Lys Ser Glu Lys Pro lie Asn Met Glu Asn Leu Pro 

410 415 420 

Val Asn Gly Gly Asn Gly Gin Ser Phe Gly Tyr lie Leu Tyr Glu 

425 430 435 

Thr Ser lie Thr Ser Ser Gly lie Leu Ser Gly His Val His Asp 

440 445 450 

Arg Gly Gin Val Phe Val Asn Thr Val Ser lie Gly Phe Leu Asp 

455 460 465 

Tyr Lys Thr Thr Lys lie Ala Val Pro Leu lie Gin Gly Tyr Thr 

470 475 480 

Val Leu Arg lie Leu Val Glu Asn Arg Gly Arg Val Asn Tyr Gly 

485 490 495 

Glu Asn lie Asp Asp Gin Arg Lys Gly Leu lie Gly Asn Leu Tyr 

500 505 510 

Leu Asn Asp Ser Pro Leu Lys Asn Phe Arg lie Tyr Ser Leu Asp 

515 520 525 

Met Lys Lys Ser Phe Phe Gin Arg Phe Gly Leu Asp Lys Trp Ser 

530 535 540 

Ser Leu Pro Glu Thr Pro Thr Leu Pro Ala Phe Phe Leu Gly Ser 

545 550 555 

Leu Ser lie Ser Ser Thr Pro Cys Asp Thr Phe Leu Lys Leu Glu 

560 565 570 

Gly Trp Glu Lys Gly Val Val Phe lie Asn Gly Gin Asn Leu Gly 

575 580 585 

Arg Tyr Trp Asn lie Gly Pro Gin Lys Thr Leu Tyr Leu Pro Gly 

590 595 600 

Pro Trp Leu Ser Ser Gly lie Asn Gin Val lie Val Phe Glu Glu 

605 610 615 

Thr Met Ala Gly Pro Ala Leu Gin Phe Thr Glu Thr Pro His Leu 

620 625 630 

Gly Arg Asn Gin Tyr lie Lys 
635 



<210> 15 
<211> 314 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eafcure 

<223> Incyte ID No: 4107621CD1 

<400> 15 

Met Ser Glu Asn Ala Ala Pro Gly Leu lie Ser Glu Leu Lys Leu 

15 10 15 

Ala Val Pro Trp Gly His lie Ala Ala Lys Ala Trp Gly Ser Leu 

20 25 30 
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Gin Gly Pro Pro Val Leu Cys Leu His Gly Trp Leu Asp Asn Ala 
35 40 45 

Ser Ser Phe Asp Arg Leu lie Pro Leu Leu Pro Gin Asp Phe Tyr 
50 55 60 

Tyr Val Ala Met Asp Phe Gly Gly His Gly. Leu Ser Ser His Tyr 
65 70 75 

Ser Pro Gly Val Pro Tyr Tyr Leu Gin Thr Phe Val Ser Glu lie 
80 85 90 

Arg Arg Val Val Ala Ala Leu Lys Trp Asn Arg Phe Ser lie Leu 
95 100 105 

Gly His Ser Phe Gly Gly Val Val Gly Gly Met Phe Phe Cys Thr 
110 115 120 

Phe Pro Glu Met Val Asp Lys Leu lie Leu Leu Asp Thr Pro Leu 
125 130 135 

Phe Leu Leu Glu Ser Asp Glu Met Glu Asn Leu Leu Thr Tyr Lys 
140 145 150 

Arg Arg Ala He Glu His Val Leu Gin Val Glu Ala Ser Gin Glu 
155 160 165 

Pro Ser His Val Phe Ser Leu Lys Gin Leu Leu Gin Arg Leu Leu 
170 175 180 

Lys Ser Asn Ser His Leu Ser Glu Glu Cys Gly Glu Leu Leu Leu 
185 190 195 

Gin Arg Gly Thr Thr Lys Val Ala Thr Gly Leu Val Leu Asn Arg 
200 205 210 

Asp Gin Arg Leu Ala Trp Ala Glu Asn Ser He Asp Phe He Ser 
215 220 225 

Arg Glu Leu Cys Ala His Ser He Arg Lys Leu Gin Ala His Val 
230 235 240 

Leu Leu He Lys Ala Val His Gly Tyr Phe Asp Ser Arg Gin Asn 
245 250 255 

Tyr Ser Glu Lys Glu Ser Leu Ser Phe Met He Asp Thr Met Lys 
260 265 270 

Ser Thr Leu Lys Glu Gin Phe Gin Phe Val Glu Val Pro Gly Asn 
275 280 285 

His Cys Val His Met Ser Glu Pro Gin His Val Ala Ser He He 
290 295 300 

Ser Ser Phe Leu Gin Cys Thr His Met Leu Pro Ala Gin Leu 
305 310 



<210> 16 

<211> 448 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 4661133CD1 

<400> 16 

Met Arg Arg Ala Ala Leu Arg Leu 

1 5 
Leu Thr Pro Gly Arg Gly Leu Thr 
20 . 

Lys Gin Gly He Phe His He His 



Cys Ala Leu Gly Lys Gly Gin 
10 15 

Gin Gly Pro Gin Asn Pro Lys 
25 30 

Glu Ala Cys Ser Ser He His 
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35 40 45 

Val Asn His Val Arg Asp Lys Leu Arg Glu lie Val Gly Ala Ser 
50 55 60 

Thr Asn Trp Arg Asp His Val Lys Ala Met Glu Glu Arg Lys Leu 
65 70 75 

Leu His Ser Phe Leu Ala Lys Ser Gin Asp Gly Leu Pro Pro Arg 
80 85 90 

Arg Met Lys Asp Ser Tyr lie Glu Val Leu Leu Pro Leu Gly Ser 
95 100 105 

Glu Pro Glu Leu Arg Glu Lys Tyr Leu Thr Val Gin Asn Thr Val 
HO 115 120 

Arg Phe Gly Arg lie Leu Glu Asp Leu Asp Ser Leu Gly Val Leu 
125 130 135 

He Cys Tyr Met His Asn Lys He His Ser Ala Lys Met Ser Pro 
140 145 150 

Leu Ser He Val Thr Ala Leu Val Asp Lys lie Asp Met Cys Lys 
155 160 165 

Lys Ser Leu Ser Pro Glu Gin Asp He Lys Phe Ser Gly His Val 
170 175 180 

Ser Trp Val Gly Lys Thr Ser Met Glu Val Lys Met Gin Met Phe 
185 190 195 

Gin Leu His Gly Asp Glu Phe Cys Pro Val Leu Asp Ala Thr Phe 
200 205 210 

Val Met Val Ala Arg Asp Ser Glu Asn Lys Gly Pro Ala Phe Val 
215 220 225 

Asn Pro Leu He Pro Glu Ser Pro Glu Glu Glu Glu Leu Phe Arg 
230 235 240 

Gin Gly Glu Leu Asn Lys Gly Arg Arg He Ala Phe Ser Ser Thr 
245 250 255 

Ser Leu Leu Lys Met Ala Pro Ser Ala Glu Glu Arg Thr Thr He 
260 265 270 

His Glu Met Phe Leu Ser Thr Leu Asp Pro Lys Thr He Ser Phe 
275 280 285 

Arg Ser Arg Val Leu Pro Ser Asn Ala Val Trp Met Glu Asn Ser 
290 295 300 

Lys Leu Lys Ser Leu Glu He Cys His Pro Gin Glu Arg Asn He 
305 310 315 

Phe Asn Arg He Phe Gly Gly Phe Leu Met Arg Lys Ala Tyr Glu 
320 325 330 

Leu Ala Trp Ala Thr Ala Cys Ser Phe Gly Gly Ser Arg Pro Phe 
335 340 345 

Val Val Ala Val Asp Asp He Met Phe Gin Lys Pro Val Glu Val 
350 355 360 

Gly Ser Leu Leu Phe Leu Ser Ser Gin Val Cys Phe Thr Gin Asn 
365 370 375 

Asn Tyr He Gin Val Arg Val His Ser Glu Val Ala Ser Leu Gin 
380 385 390 

Glu Lys Gin His Thr Thr Thr Asn Val Phe His Phe Thr Phe Met 
395 400 405 

Ser Glu Lys Glu Val Pro Leu Val Phe Pro Lys Thr Tyr Gly Glu 
410 . 415 420 

Ser Met Leu Tyr Leu Asp Gly Gin Arg 'His Phe Asn Ser Met Ser 
425 430 435 

Gly Pro Ala Thr Leu Arg Lys Asp Tyr Leu Val Glu Pro 
440 445 
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<210> 17 

<211> 723 

<212> DNA 

<213> Homo sapiens 

<220> 

<22l> misc_feature 

<223> Incyte ID No: 2293764CB1 

<400> 17 

gcagcaacag agttgcaggt gtaaaataac gggaaggcgg gatgcgtggc taaattgctc 60 
tgcgtgcaca aagagtagga gagcccagag ttccagaatg cccctaattc cgaacaccac 120 
a <?ggtgagtc tggagcaagt cacctgggag ggcttacagg tgccataatg aaggcctggg 18 0 
gcactgtggt agtgaccttg gccacgctga tggttgtcac tgtggatgcc aagatctatg 240 
aactctgcga gctggcggca agactggaga gagcagggct gaacggctac aagggctacg 300 
gcgttggaga ctggctgtgc atggctcatt atgagagtgg ctttgacacc gccttcgtgg 360 
accacaatcc tgatggcagc agtgaatatg gcattttcca actgaattct gcctggtggt 420 
gtgacaatgg cattacaccc accaagaacc tctgccacat ggattgtcat gacctgctca 480 
atcgccatat tctggatgac atcaggtgtg ccaagcagat tgtgtcctca cagaatgggc 54 0 
tttctgcctg gacttcttgg aggctacact gttctggcca tgatttatct gaatggctca 600 
a ggggtgtga tatgcatgtg aaaattgatc caaaaattca tccatgactc agattcgaag 660 
agacagattt tatcttcctt tcatttcttc atattgtcac tttaataaag gatggtactc 720 
gtc 723 



<210> 18 

<211> 1228 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> raisc_f eature 

<223> Incyte ID No : 949738CB1 

<400> 18 

cccggagccg ccagaccgtc gcgcccctgc 
aggacccccg ctaaaagcca gagctcccag 
ctccaccaac tctgtcctcg ggggggtggg 
fc 9ggggtggc cgctggagac aggtgaagaa 
a 9ggggagtc tgtgtggatg ggatggggac 
tgccctgatc cggggagtcc cagagagcct 
tcccgctctg gatctggcca aagctcaaag 
gcaacgactg gggctacagc tgctagaact 
gctgcttggc gacacggccg tgatccaagg 
ccccgctcgt aggccagagg tcgatggagt 
aattgtggaa ataggagacg agaacgcgac 
ccgggagttt ttcgtaggcc tctccaaatg 
ggacacgttc cgggacttcg ccgtctccac 
cggtctctgc ggcatggggg gacctcgcac 
aaaggctgtc cgggcaatgg cagtgctgac 
agatgacgca gctgctgact gtctcttcct 
cctcctgcac cgtggaggtg gggatctgcc 
tgatgtcacc ctggtacctg tgtcctgctc 
ctccctctgc ttggtgctca gcacacgccc 



cccatcgtag tatatgagct cgcctacaca 60 
tccccgaggc ttgaagacgg ggactccctt 12 0 
gccccagccg agatcacagc gcgacaggag 18 0 
acaagaaaac taagaaatcc gagcggttgg 24 0 
gccgggggag gggctgggcc gctgctccca 300 
ggcgtcgggg gaaggtgcgg gggctggcct 36 0 
ggagcacggg gtgctgggag gtaaactgag 42 0 
gccacctgag gagtcattgc cgctgggacc 4 80 
ggacacggcc ctaatcacgc ggccctggag 54 0 
ccgcaaagcc ctgcaagacc tggggctccg 600 
gctggatggc actgacgttc tcttcaccgg 660 
gaccaatcac cgaggagctg agatcgtggc 72 0 
tgtgccagtc tcgggtccct cccacctgcg 780 
tgttgtggca ggcagcagcg acgctgccca 84 0 
agatcaccca tatgcctccc tgaccctccc 900 
tcgtcctggg ttgcctggtg tgcccccttt 960 
caacagccag gaggcactgc agaagctctc 102 0 
agaactggag aaggccggcg ccgggctcag 1080 
ccacagctga gggcctggcc ttggggtact 114 0 
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gctggccagg ggtaggatag tataggaagt agaaggggaa ggagggttag atagagaatg 1200 
ctgaataggc agtagttggg agagaggg 1228 



<210> 19 

<211> 2155 

<212> DNA 

<213 > Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1297034CB1 

<400> 19 

cggctcgagc tcgcttctcg ttctactgcc ccaggagccc ggcgggtccg ggactcccgt 60 
ccgtgccggt gcgggcgccg gcatgtggct gtgggaggac cagggcggcc tcctgggccc 12 0 
tttctccttc ctgctgctag tgctgctgct ggtgacgcgg agcccggtca atgcctgcct 180 
cctcaccggc agcctcttcg ttctactgcg cgtcttcagc tttgagccgg tgccctcttg 240 
cagggccctg caggtgctca agccccggga ccgcatttct gccatcgccc accgtggcgg 300 
cagccacgac gcgcccgaga acacgctggc ggccattcgg caggcagcta agaatggagc 360 
aacaggcgtg gagttggaca ttgagtttac ttctgacggg attcctgtct taatgcacga 420 
taacacagta gataggacga ctgatgggac tgggcgattg tgtgatttga catttgaaca 480 
aattaggaag ctgaatcctg cagcaaacca cagactcagg aatgatttcc ctgatgaaaa 54 0 
gatccctacc ctaagggaag ctgttgcaga gtgcctaaac cataacctca caatcttctt 600 
tgatgtcaaa ggccatgcac acaaggctac tgaggctcta aagaaaatgt atatggaatt 660 
tcctcaactg tataataata gtgtggtctg ttctttcttg ccagaagtta tctacaagat 720 
gagacaaaca gatcgggatg taataacagc attaactcac agaccttgga gcctaagcca 780 
tacaggagat gggaaaccac gctatgatac tttctggaaa cattttatat ttgttatgat 840 
ggacattttg ctcgattgga gcatgcataa tatcttgtgg tacctgtgtg gaatttcagc 900 
tttcctcatg caaaaggatt ttgtatcccc ggcctacttg aagaagtggt cagctaaagg 960 
aatccaggtt gttggttgga ctgttaatac ctttgatgaa aagagttact acgaatccca 1020 
tcttggttcc agctatatca ctgacagcat ggtagaagac tgcgaacctc acttctagac 1080 
tttcacggtg ggacgaaacg ggttcagaaa ctgccagggg cctcatacag ggatatcaaa 1140 
ataccctttg tgctagccca ggccctgggg aatcaggtga ctcacacaaa tgcaatagtt 1200 
ggtcactgca tttttacctg aaccaaagct aaacccggtg ttgccaccat gcaccatggc 1260 
atgccagagt tcaacactgt tgctcttgaa aatctgggtc tgaaaaaacg cacaagagcc 1320 
cctgccctgc cctagctgag gcacacaggg agacccagtg aggataagca cagattgaat 1380 
tgtacaattt gcagatgcag atgtaaatgc atgggacatg catgataact cagagttgac 144 0 
attttaaaac ttgccacact tatttcaaat atttgtactc agctatgtta acatgtactg 1500 
tagacatcaa acttgtggcc atactaataa aattattaaa aggagcacta aaggaaaact 1560 
gtgtgccaag catcatatcc taaggcatac ggaatttggg gaagccacca tgcaatccag 1620 
tgaggcttca gtgtacagca accaaaatgg tagggaggtc ttgaagccaa tgagggattt 16 8 0 
atagcatctt gaatagagag ctgcaaacca ccagggggca gagttgcact tttccaggct 1740 
ttttaggaag ctctgcaaca gatgtgatct gatcataggc aattagaact ggaagaaact 1800 
tccaaaaata tctaggtttg tcctcatttt acaaatgagg aaactaaact ctgtggaagg 1860 
gaaggggttg cctcaaaagt cacagcttag ctgggcacag tggctcatgc cgataatccc 1920 
agcaattcag aaagctgagg caggaggatt acttgaggcc agactgggca atatagcaag 1980 
accccatctc taaaaaatta ggcatggtgg tgcatgcctg tattcccagc tactcaggag 2040 
gttgaggtgg gaggatcact tgagcccaga agttcaaggc tgcaatgagc catgattaca 2100 
ccacggcact acaaccttgg tggcacagtg agaacgcgac tcttaaaaaa aaaaa 2155 



<210> 20 
<211> 491 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 1553276CB1 

<400> 20 

gcccatggcc gcagccctgg cgctcgtggc gggggtcctg tcgggggcgg tgctgcccct 60 
ctggagcgcg cttccgcaat ataaaaagaa aatcacagac aggtgcttcc accactctga 120 
gtgctacagt ggctgctgcc tcatggactt ggactccggt ggagccttct gtgcccccag 180 
ggccagaata accatgatct gcttgcccca gtggttggaa ctcttcaagg gcagggattg 240 
catcatattc atctatgaag cacctacccc cagcttagta tctgcacata accaagggag 300 
ctaccaacat catctgccct tgccggatgg gcttgacgtg catatccaag gacttgatgt 360 
gttcccgccg gtgccatatg atttagagga agatgcaggc tggtcactgc tcccttgggg 42 0 
ccataggccc tggttgccac caacttgctc caaatccagc tcctgagaca ttaaagtcac 480 
ttcctgtcaa a A91 



<210> 21 
<211> 2101 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 1702211CB1 

<400> 21 

cccacgcgtc cgcttctgtc gaaccagttg taaggagaat ggagagagca gtgagagtgg 6 0 
agtccggggt cctggtcggg gtggtctgtc tgctcctggc atgccctgcc acagccactg 120 
ggcccgaagt tgctcagcct gaagtagaca ccaccctggg tcgtgtgcga ggccggcagg 180 
tgggcgtgaa gggcacagac cgccttgtga atgtctttct gggcattcca tttgcccagc 240 
cgccactggg ccctgaccgg ttctcagccc cacacccagc acagccctgg gagggtgtgc 300 
gggatgccag cactgcgccc ccaatgtgcc tacaagacgt ggagagcatg aacagcagca 360 
gatttgtcct caacggaaaa cagcagatct tctccgtttc agaggactgc ctggtcctca 42 0 
acgtctatag cccagctgag gtccccgcag ggtccggtag gccggtcatg gtatgggtcc 480 
atggaggcgc tctgataact ggcgctgcca cctcctacga tggatcagct ctggctgcct 540 
atggggatgt ggtcgtggtt acagtccagt accgccttgg ggtccttggc ttcttcagca 600 
ctggagatga gcatgcacct ggcaaccagg gcttcctaga tgtggtagct gctttgcgct 660 
gggtgcaaga aaacatcgcc cccttcgggg gtgacctcaa ctgtgtcact gtctttggtg 720 
gatctgccgg tgggagcatc atctctggcc tggtcctgtc cccagtggct gcagggctgt 780 
tccacagagc catcacacag agtggggtca tcaccacccc agggatcatc gactctcacc 840 
cttggcccct agctcagaaa atcgcaaaca ccttggcctg cagctccagc tccccggctg 900 
agatggtgca gtgccttcag cagaaagaag gagaagagct ggtccttagc aagaagctga 96 0 
aaaatactat ctatcctctc accgttgatg gcactgtctt ccccaaaagc cccaaggaac 1020 
tcctgaagga gaagcccttc cactctgtgc ccttcctcat gggtgtcaac aaccatgagt 1080 
tcagctggct catccccagg ggctggggtc tcctggatac aatggagcag atgagccggg 114 0 
aggacatgct ggccatctca acacccgtct tgaccagtct ggatgtgccc cctgagatga 1200 
tgcccaccgt catagatgaa tacctaggaa gcaactcgga cgcacaagcc aaatgccagg 126 0 
cgttccagga attcatgggt gacgtattca tcaatgttcc caccgtcagt ttttcaagat 1320 
accttcgaga ttctggaagc cctgtctttt tctatgagtt ccagcatcga cccagttctt 1380 
ttgcgaagat caaacctgcc tgggtgaagg ctgatcatgg ggccgagggt gcttttgtgt 144 0 
tcggaggtcc cttcctcatg gacgagagct cccgcctggc ctttccagag gccacagagg 1500 
aggagaagca gctaagcctc accatgatgg cccagtggac ccactttgcc cggacagggg 1560 
accccaatag caaggctctg cctccttggc cccaattcaa ccaggcggaa caatatctgg 162 0 
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agatcaaccc agtgccacgg gccggacaga 
cagagacgct ccccagcaag atacaacagt 
aggaggacct ctgaggccag gcctgaacct 
gtggcagagt cccagcacgg cagcccgcct 
gcccttaaag tgtcggccgc tctgtgactg 
cgcctcccac ctctggggca ttgtacaagt 
tttcttcgtg gtaggttcta gcacattcct 
aagccttccc tgccttctct gggctgtgcg 
c 



agttcaggga ggcctggatg cagttctggt 1680 
ggcaccagaa gcagaagaac aggaaggccc 174 0 
tcttggctgg ggcaaaccac tcttcaagtg 1800 
ctccccctgc tgagacttta atctccacca 1860 
gagttatgct cttttgaaat gtcacaaggc 1920 
tcttccctct ccctgaagtg cctttcctgc 1980 
ctagcttcct ggaggactca ctcccccagg 2040 
gccccgagtc tgcgtccatt agagcacagt 2100 

2101 



<210> 22 
<211> 1834 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1859618CB1 

<400> 22 

gcccccagtc caggcaggta taaggccacc tccgcaggcc aggacaaccc agaagcaaaa 6 0 
gagcagagct accatgtcct cttggagcag acagcgacca aaaagcccag ggggcattca 12 0 
accccatgtt tctagaactc tgttcctgct gctgctgttg gcagcctcag cctggggggt 180 
caccctgagc cccaaagact gccaggtgtt ccgctcagac catggcagct ccatctcctg 24 0 
tcaaccacct gccgaaatcc ccggctacct gccagccgac accgtgcacc tggccgtgga 300 
attcttcaac ctgacccacc tgccagccaa cctcctccag ggcgcctcta agctccaaga 360 
attgcacctc tccagcaatg ggctggaaag cctctcgccc gaattcctgc ggccagtgcc 42 0 
gcagctgagg gtgctggatc taacccgaaa cgccctgacc gggctgcccc cgggcctctt 48 0 
ccaggcctca gccaccctgg acaccctggt attgaaagaa aaccagctgg aggtcctgga 54 0 
ggtctcgtgg ctacacggcc tgaaagctct ggggcatctg gacctgtctg ggaaccgcct 600 
ccggaaactg ccccccgggc tgctggccaa cttcaccctc ctgcgcaccc ttgaccttgg 660 
ggagaaccag ttggagacct tgccacctga cctcctgagg ggtccgctgc aattagaacg 72 0 
gctacatcta gaaggcaaca aattgcaagt actgggaaaa gatctcctct tgccgcagcc 780 
ggacctgcgc tacctcttcc tgaacggcaa caagctggcc agggtggcag ccggtgcctt 84 0 
ccagggcctg cggcagctgg acatgctgga cctctccaat aactcactgg ccagcgtgcc 900 
cgaggggctc tgggcatccc tagggcagcc aaactgggac atgcgggatg gcttcgacat 960 
ctccggcaac ccctggatct gtgaccagaa cctgagcgac ctctatcgtt ggcttcaggc 1020 
ccaaaaagac aagatgtttt cccagaatga cacgcgctgt gctgggcctg aagccgtgaa 108 0 
gggccagacg ctcctggcag tggccaagtc ccagtgagac caggggcttg ggttgagggt 114 0 
ggggggtctg gtagaacact gcaacccgct taacaaataa tcctgccttt ggccgggtgc 12 00 
gggggctcac gcctgtaatc ccagcacttt gggaggccca ggtgggcgga tcacgaggtc 12 6 0 
aggagatcga gaccatcttg gctaacatgg tgaaaccctg tctctactaa aaatataaaa 132 0 
aattagccag gcgtggtggt gggcacctgt agtcccagca actcgggagg ctgaggcagg 13 8 0 
agaatggcgt gaacttggga ggcggagctt gcggtgagcc aagatcgtgc cactgcactc 144 0 
tagcctgggc gacagagcaa gactgtctca aaaaaaatta aaattaaaat taaaaacaaa 1500 
taatcctgcc ttttacaggt gaaactcggg gctgtccata gcggctggga ccccgtttca 156 0 
tccatccatg cttcctagaa cacacgatgg gctttcctta cccatgccca aggtgtgccc 1620 
tccgtctgga atgccgttcc ctgtttccca gatctcttga actctgggtt ctcccagccc 1680 
cttgtccttc cttccagctg agccctggcc acactggggc tgcctttctc tgactctgtc 174 0 
ttccccaagt cagggggctc tctgagtgca gggtctgatg ctgagtccca cttagcttgg 18 00 
ggtcagaacc aaggggttta ataaataacc cttg 1834 



<210> 23 
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<211> 753 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2011071CB1 



<400> 23 

atgcaggacg 

gactcaactg 

ctccggcagg 

acaggcgccg 

ggcctggaca 

agcggctaca 

ttccagatca 

catgtcgcct 

aaaattgtta 

ggcagagacc 

aggatgcttt 

tgtcccgttt 

ttaagaaaat 



ctcccctgag 
agaagtcagc 
ctttgaggat 
agtccaaaat 
attactgggg 
acaccacagc 
acacgttcgc 
gctcagcctt 
aagagacaca 
tgtccgagtg 
gcagcaacgc 
cctcccaata 
aaatatttcc 



ctgcctgtca 
ctctggggca 
gaaggctgcg 
ctacactcgt 
cttcagcctt 
cccgacggtc 
gtggtgcaga 
gatcactgat 
aggaatgaac 
gaaaaaaggc 
cctaggattt 
ttccttctca 
atttaaatgt 



ccgactaggt 
ggcaccagga 
ggcattctga 
tgcaaactgg 
ggaaactgga 
ctggatgacg 
cgcggaaagc 
gacctcacag 
tattggcaag 
tgtgaggttt 
gcagtgaatg 
aacttggaga 
caa 



ggagcagtgt 
atctgccttt 
ccctcattgg 
caaaaatatt 
tctgcatggc 
gcagcatcga 
tgaaggagaa 
atgcaattat 
gctggaagaa 
cctaaactgg 
tccaaatgcc 
gggaaaat ta 



ttct tccgca 
tcagttctgt 
ctgcctggtc 
ctcgagggct 
atattatgag 
ctatggcatc 
caaccactgc 
ctgtgccagg 
acattgtgag 
aactggaccc 
tgtgtcatct 
agctatactt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
753 



<210> 24 
<211> 1395 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2186517CB1 

<400> 24 

gcccccagca tggcttggca gggctggccc gcggcgtggc agtgggtcgc cggctgctgg 6 0 
ctcctcctcg tccttgtcct cgtcctactt gtgagccccc gcggctgccg agcgcggcgg 12 0 
ggcctccgcg gtctgctcat ggcgcacagc cagcggctgc tcttccgaat cgggtacagc 18 0 
ctgtacaccc gcacctggct cgggtacctc ttctaccgac agcagctgcg cagggctcgg 24 0 
aatcgctacc ctaaaggcca ctcgaaaacc cagacccgcc tcttcaatgg agtgaaggtg 300 
cttcccatcc ctgtcctctc ggacaactac agctacctca tcatcgacac ccaggcccag 360 
ctggctgtgg ctgtggaccc ttcagaccct cgggctgtgc aggcttccat tgaaaaggaa 42 0 
ggggtcacct tggtcgccat tctgtgtact cacaagcact gggaccacag tggagggaac 48 0 
cgtgacctca gccggcggca ccgggactgt cgggtgtacg ggagccctca ggacggcatc 54 0 
ccctacctca cccatcccct gtgtcatcaa gatgtggtca gcgtgggacg gcttcagatc 6 00 
cgggccctgg ctacacctgg ccacacacaa ggccatctgg tctacctact gg^tggggag 66 0 
ccctacaagg gtccctcctg cctcttctca ggggacctgc tcttcctctc tggctgtggg 72 0 
cggacctttg agggcaatgc agagaccatg ctgagctcac tggacactgt gctggggcta 78 0 
ggggatgaca cccttctgtg gcctggtcat gagtatgcag aggagaacct gggctttgca 84 0 
ggtgtggtgg agcccgagaa cctggcccgg gagaggaaga tgcagtgggt gcagcggcag 900 
cggctggagc gcaagggcac gtgcccatct accctgggag aggagcgctc ctacaacccg 96 0 
ttcctgagaa cccactgcct ggcgctacag gaggctctgg ggccggggcc gggccccact 1020 
ggggatgatg actactcccg ggcccagctc ctggaagagc tccgccggct gaaggatatg 10 80 
cacaagagca agtgatgccc ccagcgcccc cagcccagcc cactccccgc atggggaggc 114 0 
cgccaccacc aacacctcat catccttctc atcgctaaca ccaccacctc catcggcacc 12 00 
caagcgggca tcatcccccc acactgctca ggggagggga gggatcaggc gatgagactg 12 6 0 
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tgaggccaaa agaagggggc ctgttggagg ctgggaaccc cgcagcgcga ggctgcctca 1320 
tcaacggcaa gaggaaagga ggggtctcgg gacatctcca gaccctacca actgggaggg 1380 

tCCCCtCCtC CttCC loqc 



<210> 25 
<211> 1413 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2253585CB1 

, <400> 25 

gcgagccggc caacagcttg caagcatgct ccgctggacc cgagcctgga ggctcccgcg 6 0 
tgagggactc ggcccccacg gccctagctt cgcgagggtg cctgtcgcac ccagcagcag 12 0 
cagcggcggc cgagggggcg ccgagccgag gccgcttccg ctttcctaca ggcttctgga 18 0 
cggggaggca gccctcccgg ccgtcgtctt tttgcacggg ctcttcggca gcaaaactaa 24 0 
cttcaactcc atcgccaaga tcttggccca gcagacaggc cgtagggtgc tgacggtgga 300 
tgctcgtaac cacggtgaca gcccccacag cccagacatg agctacgaga tcatgagcca 360 
ggacctgcag gaccttctgc cccagctggg cctggtgccc tgcgtcgtcg ttggccacag 42 0 
catgggagga aagacagcca tgctgctggc actacagagg ccagagctgg tggaacgtct 480 
cattgctgta gatatcagcc cagtggaaag cacaggtgtc tcccactttg caacctacgt 54 0 
ggcagccatg agggccatca acatcgcaga tgagctgccc cgctcccgtg cccgaaaact 600 
ggcggatgaa cagctcagtt ctgtcatcca ggacatggcc gtgcggcagc acctgctcac 660 
taacctggta gaggtagacg ggcgcttcgt gtggagggtg aacttggatg ccctgaccca 72 0 
gcacctagac aagatcttgg ctttcccaca gaggcaggag tcctacctcg ggccaacact 78 0 
ctttctcctt ggtggaaact cccagttcgt gcatcccagc caccaccctg agattatgcg 840 
gctcttccct cgggcccaga tgcagacggt gccgaacgct ggccactgga tccacgctga 900 
ccgcccacag gacttcatag ctgccatccg aggcttcctg gtctaagagt tgctggcaag 960 
aagatggccg ggcgtggtgg ctcatgcctg taattccagc actttgggag gctaaggcgg 1020 
gaggatgact tgaggccagg agttggagac cagcctggcc aacatggtga aaccctgtct 1080 
ctactaaaaa tacaaaaatt agcctggcgt ggtggtgcac acctgtaatc ccagctactc 114 0 
gggaggctga ggcaggagaa tcacttgaac cctggaggca gaggttgcaa tgagccgaga 1200 
tcacaccact acactccagc ctgggcaaca gagcaagact ctgtctcaaa aaaaaacaaa 1260 
acaaaaagga ggcacaaaac cccaggcttc aagtctctgc agcctgctcc acatttgggc 132 0 
acagaaggac tcagacaggc actgtgtggg cacgaggttt tacaggggtg agtcagacct 13 8 0 
caggctttaa tgaataaagc actcagctat aaa 1413 



<210> 26 

<211> 1868 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc feature 

<223> Incyte ID No: 2447520CB1 

<400> 26 

ttggagcctg gagactgcta gctgcctggt tctttaagaa ccagccctgg tccagcccat 60 
tccgcaggcc agcaagcttc tgaaaagcaa acctaggaag tagctttcca acataaagtg 12 0 
gaggtttcaa cacaggagac tttaagcaag ttccagtgtg tctatatttg gtctggctga 180 
tcggctggac tctggccttc cccgctcacg ttagcagaca gctctgccct agtgggcgct 24 0 
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tagcctgcga 
agttgttctc 
gctgtaccaa 
ggttggtcag 
gggccgctgc 
gccggtcacg 
gcgctaccgc 
cagcttcgcg 

ggtgcggctg 

tggccacgac 
cccgcccggg 
gccgccagaa 
gctggagtat 
ttataactat 
agccatgaac 
aatttccaaa 
tgctgtcgtc 
gaccctgccc 
agacattgtg 
tgtggaaagg 
gagtgagttc 
ccagatcatc 
tcgggcttcc 
tcatgccatg 
gggtggccac 
tctgttgcta 
cttttgtttt 
aaaaaaaa 



cggcagcccg 
aggtctgaat 
tggagcctga 
atcattaggg 
tgctgggacg 
ctgcgcgcgt 
gccgacactc 
gggcttgagc 
gtgaagcgcg 
cccgaccccg 
gtgcggcgcg 
cctgggccct 
cgggctagtc 
gaagacctcc 
tacttgctca 

gggggtg^gc 

atcaacggct 
cctgtgggcg 
gatgtcctga 
gcagagagca 
tatgctaatg 
tgttacccag 
ctgcatgcct 
gctcaggtgg 
gaggggacaa 
atctctcctg 
taataactaa 



agaggatgtc 
t caaaatggc 
agagttcggc 
ttcctgctcg 
aaccggtgcg 
ccctgcgcga 
ttggcgagct 
ccatggggct 
acgtgcgaac 
ggcggctgct 
agccggtgcg 
ttcctgggat 
tgctggctgg 
ccaagaccat 
gtcatcccga 
tctgcctttc 
ctgtggccaa 
tcaacagaaa 
acagcccttt 
ccttcctgtt 
aggcctgtaa 
agacagggca 
tggtgggcag 
atgcttggaa 
tcccatcaaa 
gaaacatctg 
agttttttcc 



taacaagctt 
ctcatctcct 
gcagttcctg 
gatggcggcg 
aatcgccgtg 
cgagaagggc 
ggacctggag 
gctctgggcc 
gcccttggcc 
gtgccagacg 
cgtgggccgg 
tgtggacatg 
gaagggtttt 
ggagacgctc 
ggtaaaaggt 
catggcctct 
tgttggggga 
tcgcatcaag 
ggaaggacct 
cctggtaggt 
acgcttgcag 
ctatattgag 
tcctattatc 
acaactccag 
agtgtaaatt 
ccacatttag 
cctcattatt 



ctttctcccc 
gctgtccttc 
gggtctccac 
acgctgatcc 
cgcggcctag 
gcgcttttcc 
cgcgcgcccg 
ttggagcccg 
gtggagctgg 
cggcacgagc 
gtgcgaggca 
ttcggaactg 
gctgtgatgg 
catctggagt 
ccaggagttg 
ttcctgaagg 
accttacgct 
gtgaccaaag 
gaccagaaga 
caggatgacc 
gcccatggga 
cctccttact 
tggggagggg 
actttcttcc 
ttatttgatc 
tgtgtgtatg 
aaaatgaatt 



acccccattc 
gagcgtcccg 
agctgaggca 
tggagcctgc 
ccccggagca 
aggcccacgc 
cgctgggcgg 
agaaaccttt 
aggtgctgga 
gctacttcct 
cgctcttcct 
gaggtggcct 
ctctggctta 
actttgaaga 
ggctgcttgg 
gcatcacggc 
acaagggcga 
atggctatgc 
gcttcattcc 
acaactggaa 
ggagaaagcc 
tccccctgtg 
agcccagggc 
acaaacactt 
atgtggcctc 
tgtattcatt 
taccagtaaa 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

18 6 0 

1868 



<210> 27 

<211> 688 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

c223> Incyte ID No: 2481345CB1 



<400> 27 

cggcgttact 

gctctgctag 

ccgctaccgg 

ctggcccccg 

gtcgtgagcg 

cttcctttta 

tctccaattt 

gaatcttgga 

gagaggcatg 

tccattagta 

gcaggcccat 

ggcagaatac 



atcaagcaac 
ggagagaacg 
cgatgctact 
gagcagggga 
actccttcga 
tcaactttat 
gttgcccatc 
ataattttaa 
gctaccgaac 
atcgtgtgga 
ggttaatctt 
agacaaagca 



caaactgcaa 
ccagagcgga 
gctgtgggtg 
gcagaggcgg 
tggaaggtta 
gaagacacgt 
acgcgcagca 
gggtctagat 
acagaaattt 
agcgtgacaa 
atccgtaaca 
gtaaactg 



gctttgggag 

ggcggctggc 

tcggtggtcg 
agagcagcca 
acatttcatc 
gggacttcct 
atgtggagtg 
ccaaattata 
gggaaactgg 
gagatgttgc 
ggactaaagt 



ttgttcgctg 
ccggcggcag 
cagccttggc 
aagcgcccaa 
caggaagtca 
ttctgaatgc 
gcctcttcac 
caacatggat 
actatacttc 
tttcttactc 
cagagtgatg 



tccctgccct 
gctctcagaa 
gctggcggta 
tgtggtgctg 
ggtagtgaaa 
ctacacaaac 
tcacttaaca 
ggatgtcatg 
aggacatcac 
agacaagaag 
gaaagggatt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
688 



<210> 28 
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<211> 1375 
<212> DNA 

<213> Homo sapiens 



<220> 

<22l> misc_feature 

<223> Incyte ID NO: 2484020CB1 



<400> 28 

gcggggtggc 

agcccgcctc 

tcaagaagcg 

tgagtagcag 

aggaacctgg 

taggcagact 

ttctaacagt 

gagagtggtt 

cagagtatct 

cttcccttcc 

ctagtgtaag 

aggcttcttt 

catgcaaggt 

aaatagtgta 

actgtgccac 

ttcagttttt 

ggtataccta 

ctgtggtggg 

tttacaggat 

ataacttatt 

tctgaatttt 

actcaatagt 

tattagcaaa 



ggcggccggg 
tatgatgaag 

ggcggcgtgc 

ccggtaccca 
cggtgctgcc 
tctgggcata 
cactgaaata 
caaagtagaa 
ggaaaagcta 
ggataataat 
atagagagaa 
cgtttcctcg 
tttcaaacaa 
aaatatttta 
accccacaac 
tgtccatttg 
ccccaaaatt 
aattaaaaat 
tacaatttta 
ttgtcacagg 
cacaactggt 
accttaacac 
atgcatgtag 



cccccacggc 
ttcaagccca 
ctgtgcttcc 
gaccagtgga 
gtgagggaag 
tttgagaacc 
ttagaagatt 
gatgctatca 
aagctgggtt 
gccttgtttg 
ctgggtaggc 
tcaaacatct 
tttgcatgtt 
ataagccaaa 
cccctatatt 
acatcagtct 
gttttctcat 
attattggta 
attcaaaatt 
gcttaatttg 
attatgtcac 
aaagtataac 
taaagacatc 



ggcggccgga 
accagacgcg 
ggagcgagca 
ttgtcccagg 
tttatgagga 
aagaccgaaa 
gggaagattc 
aagttctcca 
gttccccagc 
taaccgctgc 
ctctcccacc 
gattgacgct 
tttcagatgc 
gccatgtgga 
attttggttg 
gtggtttatt 
tcacagcatt 
ttaaagaaat 
tccagatttg 
ccatttttgg 
tagctacctg 
tctgtagagt 
ttatgaaaac 



gcagcagcag 
gacctacgac 

gg^ggacgag 

aggaggaatg 
ggctggagtc 
gcacagaaca 
tgttaatatt 
gtgtcataaa 
caatggaaat 
acagacctct 
atgtgcagtc 
tgcaaactgt 
tttcaaatct 
atttttgttt 
tctatttctc 
ttgtcatcag 
agcatattca 
ccattcaccc 
ggctatttct 
ggatttgtcg 
atatggctat 
tggtgaatat 
tgtaaaaaaa 



cagcagcagg 
cgcgagggct 
gtgctgctgg 
gaacccgagg 
aaaggaaaac 
tatgtttatg 
ggaaggaaga 
cctgtacatg 
tctacagtcc 

gggttgccat 

tcatggggag 
ctgaatttgc 
ttttttaaaa 
agatgcctta 
acagcatatt 
attacttgtg 
gcaaatccat 
caaaacttgt 
gtatgatcca 
actcattttg 
ttcccttata 
tttagggaaa 
aaaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1375 



<210> 29 
<211> 1390 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2862528CB1 



<400> 29 

catcgctcca 

gggcctgcca 

cagcatgtcg 

gccagatgat 

ggaggcaggc 

ggccgccctg 

ggcgcatgtc 

cgtgatgtcc 

gtatgtgccc 

ttcccggcag 

catggagacc 



cctctgccct 
gacacctgcg 
ggcccagacg 
gccaacgtgg 
gccatcatca 
gctcgtgtcg 
agcgcggaga 
gaaaacatcc 
ctgtcgctga 
gagcaggagg 
aagtggagga 



ccccctttat 
cccttctgca 
tcgagacgcc 
ccggcaatgt 
gcacccggca 
agcgcaccga 
tcacctacac 
tcacaggtgc 
agaatgtgga 
aggagggccg 
acggggacat 



ggcgcggccc 
gccgcccgcc 
gtccgccatc 
ccacgggggg 
ttgcaacagc 
cttcctgtct 
ctccaagcac 
caaaaagctg 
caaggtcctc 
gaagcggtat 
cgtccagcca 



gggctcattc 

gcatccgccg 
cagatctgcc 
accatcctga 
cagaacgggg 
cccatgtgca 
tctgtggagg 
accaataagg 
gaggtgcctc 
gaagcccaga 
gtcctcaacc 



attccgcgcc 
ccgcagcccc 
ggatcatgcg 
agatgatcga 
agcgctgtgt 
tcggtgaggt 
tgcaggtcaa 
ccaccctgtg 
ctgttgtgta 
agctggagcg 
caggtgtgac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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catgaagctc atggatgagg tcgccgggat cgtggctgca cgccactgca agaccaacat 72 0 

cgtcacagct tccgtggacg ccattaattt tcatgacaag atcagaaaag gctgcgtcat 780 

caccatctcg ggacgcatga ccttcacgag caataagtcc atggagatcg aggtgttggt 84 0 

ggacgccgac cctgttgtgg acagctctca gaagcgctac cgggccgcca gtgccttctt 900 

cacctacgtg tcgctgagcc aggaaggcag gtcgctgcct gtgccccagc tggtgcccga 960 

gaccgaggac gagaagaagc gctttgagga aggcaaaggg cggtacctgc agatgaaggc 1020 

gaacgacagg gccacgcgga gcctcagccc tagactccct cctcctgcca ctggtgcctc 1080 

gagtagccat ggcaacgggc ccagtgtcca gtcacttaga agttcccccc ttggccaaaa 1140 

acccaattca cattgagagc tggtgttgtc tgaagttttc gtatcacagt gttaacctgt 1200 

actctctcct gcaaacctac acaccaaagc tttatttata tcattccagt atcaatgcta 1260 

cacagtgttg tcccgagcgc cgggaggcgt tgggcagaaa ccctcgggaa tgcttccgag 1320 

cacgctgtag ggtatgggaa gaacccagca ccactaataa agctgctgct tggctggaaa 1380 

aaaaaaaaaa 1390 



<210> 30 
<211> 3038 
<212> DNA 

<213> Homo sapiens 
<220> 

<22l> misc_feature 

<223> Incyte ID No: 3200650CB1 



<400> 30 

gcgcggctga 

gaccacgtgg 

cttgggcttc 

ccatcgacag 

ctggatcttc 

cctgctgaag 

gcatgagcca 

cgtcctgatg 

tagtgagatg 

gaggacaact 

cagggtggtg 

tgaatatggt 

ggaccgtggc 

gattgtccag 

gaccaccttt 

ggggtggttt 

aaccgtgtct 

caccaacttt 

cagctatgac 

gcttcgagac 

tcccaagatg 

caagtacctg 

caatggggga 

tggcatcctc 

aggattcttg 

gctgaggatc 

gcgcaaaggc 

ctatagcctg 

cctcccagaa 

cccttgtgac 



gtgcggactg 
agcctccggc 
ctggtgctcc 
ctggggctgc 

gggggctcca 

atgaaggcct 
gaaagaggca 
gccgcagaga 
gacctcgggg 
tacaagggct 
ccactccagt 
tcctataata 
attgtggaac 
ggagtcttgg 
ctcttcaacg 
gactcgtggg 
gccattgtgg 
ggcttcatga 
tatgatgctg 
ttcttcggct 
ccgtatgagc 

ggggagccaa 

aatggacagt 
agtggccacg 
gactacaaga 
ttggtggaga 
ttaattggaa 
gatatgaaga 
acacccacat 
acctttctga 



gagtgggaac 
ggaggccggc 
gcaggctgga 
aggccaaggg 
tccactattt 
gtggcttgaa 
aatttgactt 
tcgggctgtg 
gcttgcccag 
tcaccgaagc 
acaagcgtgg 
aagaccccgc 
tgctcctgac 
ccaccatcaa 
tccaggggac 
gaggccctca 
acgccggctc 
atggagccat 
tgctgacaga 
ccatctcagg 
ccttaacgcc 
tcaagtctga 
ccttcgggta 
tgcatgatcg 
caacgaagat 
atcgtgggcg 
atctctatct 
agagcttctt 
tacctgcttt 
agctggaggg 



ccgggtcccc 
ccgcacgctg 
ctggagcacc 
ctggaacttc 
ccgtgtgccc 
caccctcacc 
tctctgggaa 
ggtgattctg 
ctggctactc 
agtggacctt 
gggacctatc 
atacatgccc 
ttcagacaac 
cttgcagtca 
tcagcccaag 
caatatcttg 
ctccatcaac 
gcacttccat 
agccggcgat 
catccctctc 
agtcttgtac 
aaagcccatc 
cattctctat 
ggggcaggtg 
tgctgtcccc 
agtcaactat 
gaatgattca 
tcagaggttc 
cttcttgggt 
ctgggagaag 



gcgcttagag 
ggactcctgc 
ctggtccctc 
atgctggagg 
agggagtact 
acctatgttc 
acttggacct 
cgtccaggcc 
caagaccctg 
tattttgacc 
attgccgtgc 
tacgtcaaga 
aaggatgggc 
acacacgagc 
atggtgatgg 
gattcttctg 
ctctacatgt 
gactacaagt 
tacacggcca 
cctcccccac 
ctgtctctgt 
aacatggaga 
gagaccagca 
tttgtgaaca 
ctgatccagg 
ggggagaata 
cccctgaaaa 
ggcctggaca 
agcttgtcca 
ggggttgtat 



aacacgcgat 
tgctggtcgt 
tgcggctccg 
attccacctt 
ggagggaccg 
cgtggaacct 
tgaaggcctt 
cctacatctg 
gcatgaggct 
acctgatgtc 
aggtggagaa 
aggcactgga 
tgagcaaggg 
tgcagctact 
agtactggac 
aggttttgaa 
tccacggagg 
cagatgtcac 
agtacatgaa 
ctgaccttct 
gggacgccct 
acctgccagt 
tcacctcgtc 
cagtatccat 
gttacaccgt 
ttgatgacca 
acttcagaat 
aatggagttc 
tcagctccac 
tcatcaatgg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
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ccagaacctt 
ctggttgagc 
attacagttc 
accccctcct 
ctgctgcccc 
gctacagtct 
tctgggcctg 
cgggctgtct 
agtgctgaaa 
gcgtgctctt 
ccgaaatcct 
actttgttct 
tgagaaacat 
tgggctggag 
gttcagtgag 
ggaggaggac 
gcacatccag 
gcccatgtct 
caccccccac 
gcatttgagc 
tgcagtaaag 



ggacgttact 
agcggaatca 
acggaaaccc 
gctggtgcca 
acccctcact 
gcccctgtct 
gctttgttga 
ctagggtggg 
cgtgcccttg 
tgctggttcc 
gggtgtgtca 

tccttcacaa 
gtgacttccc 
aaacagaaat 
gaggacatgt 
agaaggccca 
ggaggaggac 
gcacatccag 
gcccgaacag 
cttgttctgg 
ctataacctt 



ggaacattgg 
accaggtcat 
cccacctggg 
gtgggagact 
gcaaaagcat 
cagctcaaaa 
tgatggcttt 
agcagctaat 
caccggacgt 

tgggaggctt 

ccagtgtaga 
ccttctgagc 
ctttcccttc 
cctcaccctg 
gagtcctggc 
gctcacatgt 
agaaggccca 
ggaggaggac 
caggggcaga 
ggcccagccc 
gaatcacaaa 



accccagaag 
cgtttttgag 
caggaaccag 
gccgcctcct 
ctccttaagt 
ccctaagcct 
cctacagccc 
cagatcgccc 
cacagccctg 
ggccacatcc 

gggtggggaa 
cttctttggg 
ccactcgctg 
cgtcttccca 
agaagccatg 
gagtcctggc 
gctcacatgt 
agaaggccca 
gcagccctcc 
aacacctggc 
aaaaaaaa 



acgctttacc 
gagacgatgg 
tacattaagt 
cttgacctga 
agcaacct ca 
gcagggaaag 
tgctcttgtg 
agcctttggc 
cgagcatctg 
ctcatggccc 

ggggtgtctc 

attctggaag 
cttcccacag 
agttagcagg 
gcccatgtct 
agaagccatg 
gagtcctggc 
gctcagtggc 
ttcgaagtgt 
ttgggctcac 



tcccaggtcc 
cgggccctgc 
gagcggtggc 
agcctggtgg 
gggactgggg 
gtgggatggc 
ccgaggctgt 
cctcagaaaa 
ctggactcag 
cattttatcc 
acctgagctg 
gaactcggcg 
ggtgacaggc 
tgtctctggt 
gcacatccag 
gcccatgtct 
agaagccatg 
ccccgccccc 
gtccaagtcc 
tgtcctgagt 



1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3038 



<210> 31 

<211> 1340 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 4107621CB1 



<400> 31 

gcgacctagc 

gagaacgccg 

gcagccaaag 

gacaatgcca 

gccatggatt 

ctccagactt 

tccattctgg 

gagatggtgg 

atggagaact 

tcccaggagc 

aatagccact 

gccacaggtc 

ttcatcagca 

atcaaagcag 

tcgttcatga 

ccaggcaatc 

ttcttacagt 

gacctagtgc 

agggatggtg 

aaggagcgag 

cagggctgct 

gctgggccca 



caggcgtgag 
caccaggtct 
cctggggctc 
gctccttcga 
tcggaggtca 
ttgtgagtga 
gccacagctt 
ataaacttat 
tgctgaccta 
cctcgcacgt 
tgagtgagga 
tggttctgaa 
gggagctgtg 
tccacggata 
tagacacgat 
actgtgtcca 
gcacacacat 
tcccagactc 
gggacaggcc 
attccaactt 
ttctcctggc 
cctagccttt 



ggagtgacag 
gatctcagag 
cctgcagggc 
cagactcatc 
tgggctctcg 
gatccgaaga 
cggtggcgtc 
cttgctggac 
caagcggaga 
gttcagcctg 
gtgcggggag 
cagagaccag 
tgcgcattcc 
ttttgattca 
gaaatccacc 
catgagcgaa 
gctcccagcc 
aacactggga 
tcactagtct 
caacatctgt 
taataataaa 
ccctgctgcc 



cagcgcattc 
ctgaagctgg 
cctccagttc 
cctcttctcc 
tcccattaca 
gttgtggcag 
gtgggcggaa 
acgccgctct 
gccatagagc 
aagcagctgc 
cttctcctgc 
aggctcgcct 
atcaggaagc 
agacagaatt 
ctcaaagagc 
ccccagcacg 
cagctgtagc 
ctctgagttc 
tgaggcccag 
gacctcaaga 
tatccagcca 
caactggatg 



gcgggacgag 
ctgtgccctg 
tctgcctgca 
cgcaagactt 
gcccaggtgt 
ccttgaaatg 
tgtttttctg 
ttctcctgga 
acgtgctgca 
tgcagaggtt 
aaagaggaac 
gggcagagaa 
tgcaggccca 
actctgagaa 
agttccagtt 
tggccagtat 
tctgggcctg 
ctgagcccca 
cctaggatgg 
gggagacaga 
gctggaggaa 
gaaaataaaa 



agcgatgagt 
gggccacatc 
cggctggctg 
ttattacgtt 
cccatattac 
gaatcgattc 
taccttcccc 
atcagatgaa 
ggtagaggcc 
actgaagagc 
cacgaaggtg 
cagcattgac 
tgtcctgttg 
ggagtccctg 
tgtggaagtc 

catcagctcc 
gaactatgaa 
caacaaggcc 
tagtcagggg 
gtctgggttc 
ggaagggcag 
ggttcttgta 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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ttctcaaaaa aaaaaaaaaa 



1340 



<210> 32 

<211> 1717 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 4661133CB1 



<400> 32 

cgcccccgga 

tcgtgcgtct 

catcgcgcgc 

gcaaagggca 

agggaatctt 

agttgcggga 

aaaggaaatt 

tgaaggacag 

aatatttgac 

tgggagttct 

cgatagttac 

aggacattaa 

tgcaaatgtt 

tggtggctcg 

gcccagagga 

tcagctccac 

agatgtttct 

ctaatgcagt 

agcggaacat 

cgtgggctac 

tcatgtttca 

ttactcagaa 

agcagcatac 

tggttttccc 

actccatgag 

acatttgttg 

gagtgtattg 

tttaggatga 

attattaaac 



caccgctgtc 
tttatcgcct 
acggcagcct 
gcttactcct 
ccacattcat 
gatagtagga 
acttcatagt 
ttatattgaa 
tgttcaaaac 
tatttgttac 
agccctggtg 
gttcagtggc 
ccagttacat 
tgattctgaa 
agaggagctc 
gtcgttactg 
cagcacactg 
gtggatggag 
tttcaatcgg 
tgcttgtagc 
gaaacctgtt 
taattatatt 
aaccaccaat 
aaaaacatat 
tggcccagcg 
aaaactagca 
attttagtat 
tcagaaaagc 
aataaatact 



ccgctcccgg 

cagtttccct 

agcgcaatga 

ggaagaggac 

gaagcatgtt 

gcatccacaa 

ttcttggcta 

gttctcttgc 

accgtaagat 

atgcacaaca 

gataagattg 

catgttagct 

ggtgatgaat 

aataaagggc 

tttagacaag 

aaaatggccc 

gatccaaaga 

aattcaaaac 

atctttggtg 

tttggtggtt 

gaggttggct 

caagtcagag 

gtctttcatt 

ggagagtcca 

accttgagaa 

ctctacccac 

tgcttcgtgt 

agaaagagag 

ttcaagacaa 



gctgtcctca 
ccgccgacta 
ggcgggcagc 
tgactcaagg 
catctataca 
actggagaga 
aatcacagga 
ctttgggcag 
ttggcaggat 
aaatccactc 
atatgtgtaa 
gggtcgggaa 
tttgtcctgt 
cggcatttgt 
gggaattgaa 
ccagcgctga 
ctataagttt 
tgaagagttt 
gtttccttat 
ctcgaccgtt 
cattgctctt 
tacacagtga 
tcacgttcat 
tgttgtactt 
aggactacct 
agtgacgtgg 
cctccacaca 

agtggccgga 

aaaaaaa 



gcaagggcgc 
gcgcgcgggg 
actgcggctt 
accccagaac 
tgtgaatcat 
ccatgtgaag 
tggactgcct 
tgagcctgaa 
tcttgaggat 
cgccaagatg 
gaagagcttg 
gacatccatg 
tttggatgca 
aaatccactc 
caaggggaga 
ggagaggacc 
tcggagtcga 
ggaaatttgc 
gaggaaggca 
tgtggtagca 
tctttcttca 
agtggcctcc 
gtcggaaaaa 
agatgggcag 
tgtggagccc 
tatctgatga 
ggaggaggat 
tggggctgag 



ggtctggtac 

cccggttctc 

tgtgccttgg 

cccaagaaac 

gttcgagata 

gcaatggaag 

cctaggagaa 

ttacgagaga 

cttgacagct 

tctcctttat 

agcccagaac 

gaagtgaaga 

acatttgtaa 

atccctgaaa 

agaattgcct 

accatacatg 

gttttaccct 

caccctcagg 

tatgaacttg 

gtagatgaca 

caggtatgct 

ctgcaggaga 

gaagtgccat 

cggcatttca 

taagaacacc 

agacctgatc 

gtattcagcc 

gggagaaaga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1717 



<210> 33 
<211> 148 
<212> PRT 

<213> Colobus guereza 
<300> 

<308> GenBank ID No: gl790927 



<400> 33 

Met Lys Ala Leu lie lie Leu Gly Leu Val Leu Leu Ser Val Thr 

15 10 15 

Val Gin Gly Lys lie Phe Glu Arg Cys Glu Leu Ala Arg Thr Leu 
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20 










25 










30 


Lys 


Lys 


Leu 


Gly 


Leu 


Asp 


Gly 


Tyr 


Lys 


Gly 


Val 


Ser 


Leu 


Ala 


Asn 










3 5 










40 










45 


Trp 


Val 


Cys 


Leu 


Ala 


Lys 


Trp 


Glu 


Ser 


Gly 


Tyr 


Asn 


Thr 


Asp 


Ala 










50 










55 










60 


Thr 


Asn 


Tyr 


Asn 


Pro 


Gly 


Asp 


Glu 


Ser 


Thr 


Asp 


Tyr 


Gly 


He 


Phe 










65 










70 










75 


Gin 


lie 


Asn 


Ser 


Arg 


Tyr 


Trp 


Cys 


Asn 


Asn 


Gly 


Lys 


Thr 


Pro 


Gly 










80 










85 










90 


Ala 


Val 


Asn 


Ala 


Cys 


His 


He 


Ser 


Cys 


Asn 


Ala 


Leu 


Leu 


Gin 


Asn 










95 










100 










105 


Asn 


lie 


Ala 


Asp 


Ala 


Val 


Ala 


Cys 


Ala 


Lys 


Arg 


Val 


Val 


Ser 


Asp 










110 










115 










120 


Pro 


Gin 


Gly 


lie 


Arg 


Ala 


Trp 


Val 


Ala 


Trp 


Lys 


Lys 


His 


Cys 


Gin 










125 










130 










135 


Asn 


Arg 


Asp 


Val 


Ser 


Gin 


Tyr 


Val 


Glu 


Gly 


Cys 


Gly 


Val 







140 145 



<210> 34 
<211> 148 
<212> PRT 

<213> Colobus angolensis 
<300> 

<308> GenBank ID No: gl790967 
<400> 34 

Met Lys Ala Leu He He Leu Gly Leu Val Leu Leu Ser Val Thr 
15 10 15 

Val Gin Gly Lys He Phe Glu Arg Cys Glu Leu Ala Arg Thr Leu 

20 25 30 

Lys Lys Leu Gly Leu Asp Gly Tyr Lys Gly Val Ser Leu Ala Asn 

35 40 45 

Trp Val Cys Leu Ala Lys Trp Glu Ser Gly Tyr Asn Thr Asp Ala 

50 55 60 

Thr Asn Tyr Asn Pro Gly Asp Glu Ser Thr Asp Tyr Gly He Phe 

65 70 75 

Gin He Asn Ser Arg Tyr Trp Cys Asn Asn Gly Lys Thr Pro Gly 

80 85 90 

Ala Val Asn Ala Cys His He Ser Cys Asn Ala Leu Leu Gin Asn 

95 100 105 

Asn He Ala Asp Ala Val Ala Cys Ala Lys Arg Val Val Ser Asp 
110 115 120 

Pro Gin Gly He Arg Ala Trp Val Ala Trp Lys Lys His Cys Gin 
125 130 135 

Asn Arg Asp Val Ser Gin Tyr Val Glu Gly Cys Gly Val 
140 145 



<210> 35 
<211> 148 
<212> PRT 
<213> Nasalis 



larvatus 
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<300> mi s c_ feature 
<308> GenBank ID No 

<400> 35 

Met Lys Ala Leu lie 
1 5 
Val Gin Gly Lys He 
20 

Lys Lys Leu Gly Leu 
35 

Trp Val Cys Leu Ala 
50 

Thr Asn ■ Tyr Asn Pro 
65 

Gin He Asn Ser Arg 
80 

Ala Val Asp Ala Cys 
95 

Asn He Ala Asp Ala 
110 

Pro Gin Gly lie Arg 
125 

Asn Arg Asp Val Ser 
140 



30/30 



: gl7909l 

He Leu 
Phe Glu 
Asp Gly 
Lys Trp 
Gly Asp 
Tyr Trp 
His He 
Val Ala 
Ala Trp 
Gin Tyr 



14 

Gly Leu 
Arg Cys 
Tyr Lys 
Glu Ser 
Glu Ser 
Cys Asn 
Ser Cys 
Cys Ala 
Val Ala 
Val Lys 



Val Leu 

10 
Glu Leu 

25 
Gly Val 

40 
Gly Tyr 

55 . 
Thr Asp 

70 

Asn Gly 

85 
Ser Ala 
100 

Lys Arg 
115 

Trp Arg 
130 

Gly Cys 
145 



Leu Ser 
Ala Arg 
Ser Leu 
Asn Thr 
Tyr Gly 
Lys Thr 
Leu Leu 
Val Val 
Asn His 
Gly Val 



Val Thr 
15 

Thr Leu 
30 

Ala Asn 
45 

Glu Ala 
60 

He Phe 
75 

Pro Gly 
90 

Gin Asn 
105 

Ser Asp 
120 

Cys Gin 
135 



